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Preface 




Photonics (also known as optoelectronics) is the technology of creation, transmission, 
detection, control and applications of light. It has many applications in various areas of 
science and engineering fields. Fibre optic communication is an important part of photonics. 
It uses light particles (photons) to carry information over optical fibre. 

In the last 20 years we have witnessed the significant (and increasing) presence of photon¬ 
ics in our everyday life. The creation of the Internet and World Wide Web was possible due 
to tremendous technical progress created by photonics, development of photonic devices, 
improvement of optical fibre, wavelength division multiplexing (WDM) techniques, etc. 
The phenomenal growth of the Internet owes a lot to the field of photonics and photonic 
devices in particular. 

This book serves as an attempt to introduce graduate students and senior undergraduates 
to the issues of computational photonics. The main motivation for developing an approach 
described in the present book was to establish the foundations needed to understand prin¬ 
ciples and devices behind photonics. 

In this book we advocate a simulation-type approach to teach fundamentals of photonics. 
We provide a self-contained development which includes theoretical foundations and also 
the MATLAB code aimed at detailed simulations of real-life devices. 

We emphasize the following characteristics of our very practical book: 

• learning through computer simulations 

• writing and analysing computer code always gives good sense of the values of all param¬ 
eters 

• our aim was to provide complete theoretical background with only basic knowledge 
assumed 

• the book is self-contained in a sense that it starts from a very basic knowledge and ends 
with the discussion of several hot topics. 

The author believes that one can learn a lot by studying not only a theory but also by 
performing numerical simulations (numerical experiments) using commercial software or 
developed in-house. Here, we adopt a view that the best way will be to develop a software 
ourselves (as opposed to commercial). Also there is one more component in the learning 
process, namely real-life experimentation. That part is outside the scope of the present 
book. 

The goal was to equip students with solid foundations and underlying principles by 
providing a theoretical background and also by applying that theory to create simple 
programs in MATLAB which illustrate theoretical concepts and which allow students to 
conduct their own numerical experiments in order to get better understanding. 
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Alternatively, students can also use octave (www.octave.org) to run and to conduct 
numerical experiments using programs developed and distributed with this book. At the 
end, they can try to develop large computer programs by combining learned ideas. At this 
stage, I would like to remind the reader that according to Donald Knuth (cited in Computer 
Algebra Handbook, J. Grabmeier, E. Kaltofen and V Weispfenning, eds., Springer, 2003) 
the tasks of increasing difficulty are: 

i) publishing a paper 

ii) publishing a book 

iii) writing a large computer program. 

The book has evolved from the lecture notes delivered in the Department of Electrical 
Engineering, University of Oulu, Finland, Institute of Physics, Wroclaw University of 
Technology and Department of Physics and Computer Science, Wilfrid Laurier University, 
Waterloo, Canada. 

The book has been designed to serve a 12-week-long term with several extra chapters 
which allow the instructor more flexibility in choosing the material. An important part of 
the book is the MATLAB code provided to solve problems. Students can modify it and 
experiment with different parameters to see for themselves the role played by different 
factors. This is a very valuable element of the learning process. 

The book also contains a small number of solved problems (labelled as Examples) which 
are intended to serve as a practical illustration of discussed topics. At the end of each 
chapter there is a number of problems intended to test the student’s understanding of the 
material. 

The book may serve as a reference for engineers, physicists, practising scientists and 
other professionals. It concentrates on operating principles of optical devices, as well as their 
models and numerical methods used for description. The covered material includes fibre, 
planar waveguides, laser diodes, detectors, optical amplifiers and semiconductor optical 
amplifiers, receivers, beam propagation method, some wavelength division devices, finite- 
difference time-domain method, linear and nonlinear pulses, solar cells and metamaterials. 

The book can also be used as a textbook as it contains questions, problems (both solved 
and unsolved), working examples and projects. It can be used as a teaching aid at both under¬ 
graduate and graduate levels as a one-semester course in physics and electrical engineering 
departments. 

The book contains 17 chapters plus two appendices. The first two chapters summarize 
basic topics in optics and electromagnetism. The book also contains a significant number 
of software exercises written in MATLAB, designed to enhance and help understanding the 
discussed topics. Many proposed projects have been designed with a view to implement 
them in MATLAB. 

Some numerical methods as well as important practical devices were not considered. 
Those developments will be included in the second edition. The author would like to wish 
a potential reader the similar joy of reading the book and experimenting with programs as 
he had with writing it. 

Finally, I welcome all type of comments from readers, especially concerning errors, 
inaccuracies and omissions. Please send your comments to mwartak@wlu.ca. 
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xiii 


Requirements I do not assume any particular knowledge since I tried to cover all basic 
topics. Some previous exposure to optics and electromagnetic theory will be helpful. 
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Introduction 




In this introductory chapter we will try to define computational photonics and to position it 
within a broad field of photonics. We will briefly summarize several subfields of photonics 
(with the main emphasis on optical fibre communication) to indicate potential possibilities 
where computational photonics can significantly contribute by reducing cost of designing 
new devices and speeding up their development. 


1.1 What is photonics? 


We start our discussion from a broader perspective by articulating what photonics is, what 
the current activities are and where one can get the most recent information. 

Photonics is the field which involves electromagnetic energy, such as light, where the 
fundamental object is a photon. In some sense, photonics is parallel to electronics which 
involves electrons. Photonics is often referred to as optoelectronics, or as electro-optics to 
indicate that both fields have a lot in common. In fact, there is a lot of interplay of photonics 
and electronics. For example, a laser is driven by electricity to produce light or to modulate 
that light to transmit data. 

Photonics applications use the photon in a similar way to that which electronic applica¬ 
tions use the electron. However, there are several advantages of optical transmission of data 
over electrical. Furthermore, photons do not interact between themselves (which is both 
good and bad), so electromagnetic beams can pass through each other without interacting 
and/or causing interference. 

Even with the telecommunication ‘bubble’ which happened some ten years ago, the 
subfield of photonics, namely optical fibre communication, is still a very important segment 
of photonics activities. As an example, a single optical fibre has the capacity to carry about 
3 million telephone calls simultaneously. But, due to a crisis around 2000, many new 
applications of photonics have emerged or got noticed. Bio-photonics or medical photonics 
are amongst the most important ones. 

Before we try to define computational photonics, let us concentrate for a moment on 
what photonics is at the time of writing (Winter, 2011). In the following paragraphs, we 
cite some information from the relevant conferences. 

Some of the well-established conferences which summarize research and applications 
in a more traditional approach to photonics, and also dictate new directions are: Photonics 
West, held every January in California (here is the 2011 information [1]); Photonics East, 
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taking place in the eastern part of the USA in the Fall; Optical Fibre Conference (OFC) 
taking place in March every year [2], 

A new conference, the Photonics Global Conference (PGC), is a biennial event held 
since 2008 [3]. The aim of this conference is to foster interactions among broad disciplines 
in photonics with concentration on emerging directions. Symposia and Special Sessions 
for 2011 are summarized below: 

• Symposia: 1. Optofluidics and Biophotonics. 2. Fibre-Based Devices and Applications. 
3. Green Photonics. 4. High-Power Lasers and Their Industrial Applications. 5. Metama¬ 
terials and Plasmonics. 6. Nanophotonics. 7. Optical Communications and Networks. 

• Special Sessions: 1. Quantum Communications. 2. Photonics Crystal Fibres and Their 
Applications. 3. Photonics Applications of Carbon Nanotube and Graphene. 4. Terahertz 
Technology. 5. Diffuse Optical Imaging. 

By looking at the discussed topics, one can get some sense about current activities in 
photonics. 


1.2 What is computational photonics? 


Computational photonics is a branch of physics which uses numerical methods to study 
properties and propagation of light in waveguiding structures. (Here, light is used in a broad 
sense as a replacement for electromagnetic waves.) Within this field, an important part is 
played by studies of the behaviour of light and light-matter interactions using analytical 
and computational models. This emerging field of computational science is playing a 
critically important role in designing new generations of integrated optics modules, long 
haul transmission and telecommunication systems. 

Generally, computational photonics is understood as the ‘replacement’ of the experimen¬ 
tal method, where one is performing all the relevant ‘experiments’ on computer. Obviously, 
such an approach reduces development cost and speeds-up development of new products. 

We will attempt to cover some of those developments. The field is, however, so broad 
that it is impossible to review all of its activities. Naturally, selection of the topics reflects 
author’s expertise. 

As a separate topic in photonics, one selects integrated photonics where the concentration 
is on waveguides, simulations of waveguide modes and photonic structures [4], The central 
role is played there by a beam-propagation method which will be discussed later in the 
hook. 

1.2.1 Methods of computational photonics. Computational electromagnetics 


According to Joannopoulos et al. [5], in a broad sense there are three categories of problems 
in computational photonics: frequency-domain eigensolvers, frequency-domain solvers 
and time-domain simulations. Those problems are extensively discussed in many sources 
including [5], also [6] and [7]. 
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A recent general article by Gallagher [8] gives an introduction to the main algorithms 
used in photonic computer aided design (CAD) modelling along with discussion of their 
strengths and weaknesses. The main algorithms are: 

• BPM - Beam Propagation Method 

• EME - Eigenmode Expansion Methods 

• FDTD - Finite Difference Time Domain. 

The above methods are compared against speed, memory usage, numerical aperture, the 
refractive index contrasts in the device, polarization, losses, reflections and nonlinearity. 
The author’s conclusion is that none of the discussed algorithms are universally perfect for 
all applications. 

We finish this section by indicating that application of computational electromagnetics 
goes well beyond photonics, see the discussion by Jin [6], In fact it has an extremely wide 
range of applications which goes from the analysis of electromagnetic wave scattering 
by various objects, antenna analysis and design, modelling and simulation of microwave 
devices, to numerical analysis of electromagnetic interference and electromagnetic com¬ 
patibility. 


1.2.2 Computational nano-photonics 


This subfield has emerged only in recent years. As the dimensions of photonics devices 
shrink, it plays an increasingly important role. Nanostructures are generally considered 
as having sizes in the order of the wavelength of light. At those wavelengths, multiple 
scattering and near-field effects have a profound influence on the propagation of light and 
light-matter interaction. In turn, this leads to novel regimes for basic research as well as 
novel applications in various disciplines. 

For instance, the modified dispersion relation of photonic crystals and photonic crystal 
fibres leads to novel nonlinear wave propagation effects such as giant soliton shifts and su¬ 
percontinuum generation with applications in telecommunication, metrology and medical 
diagnostics. Owing to the complex nature of wave interference and interaction processes, 
experimental studies rely heavily on theoretical guidance both for the design of such sys¬ 
tems as well as for the interpretation of measurements. In almost all cases, a quantitative 
theoretical description of such systems has to be based on advanced computational tech¬ 
niques that solve the corresponding numerically very large linear, nonlinear or coupled 
partial differential equations. 

University research on photonic crystal structures applied to novel components in optical 
communication, boomed since it has become possible to create structures on the nanometre 
scale. Nanotechnology offers a potential for more efficient integrated optical circuitry at 
lower cost - from passive elements like filters and equalizers to active functions such as 
optical switching, interconnects and even novel lasers. In addition, the technology can 
provide new functionality and reduce overall optical communication cost thanks to smaller 
devices, higher bandwidth and low loss device characteristics that eliminate the need for 
amplification. The ultimate goal is to create three-dimensional photonic structures that may 
lead to progressively optical and one day all-optical computing. 
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A very recent overview on modelling in photonics, including discussion of important 
computational methods, was written by Obayya [9], 

1.2.3 Overview of commercial software for photonics 


An outstanding progress in photonics has been to a great extent possible due to the avail¬ 
ability of reliable software. Some of the main commercial players are listed below: 

1. At the time of writing, Optiwave (www.optiwave.com) provides comprehensive engi¬ 
neering design tools which benefit photonic, bio-photonic and system design engineers 
with a comprehensive design environment. Current Optiwave products include two 
groups: system and amplifier design, and component design. 

For system and amplifier design they offer two packages: 

• OptiSystem - suite for design of amplifier and optical communication systems. 

• OptiSPICE - the first optoelectronic circuit design software. 

For component design, they have the following products: 

• OptiBPM, based on the beam propagation method (BPM), offers design of complex 
optical waveguides which perform guiding, coupling, switching, splitting, multiplex¬ 
ing and demultiplexing of optical signals in photonic devices. 

• OptiFDTD, which is based on the finite-difference time-domain (FDTD) algorithm 
with second-order numerical accuracy and the most advanced boundary condition, 
namely uniaxial perfectly matched layer. Solutions for both electric and magnetic 
fields in temporal and spatial domain are obtained using the full-vector differential 
form of Maxwell’s equations. 

• OptiFiber, which uses numerical mode solvers and other models specialized to fibres 
for calculating dispersion, losses, birefringence and polarization mode dispersion 
(PMD). 

• OptiGrating, which uses the coupled mode theory to model the light and enable 
analysis and synthesis of gratings. 

2. RSoft (www.rsoftdesign.com) product family includes: 

• Component Design Suite to analyse complex photonic devices and components 
through industry-leading computer-aided design, 

• System Simulation to determine the performance of optical telecom and datacom 
links through comprehensive simulation techniques and component models, and 

• Network Modeling for cost-effective deployment of DWDM and SONET technologies 
while designing and optimizing an optical network. 

3. Photon Design (www.photond.com) 

They offer several products for both passive and active component design, such as 
FIMMWAVE and CrystalWave. 

FIMMWAVE is a generic full-vectorial mode finder for waveguide structures. It com¬ 
bines methods based on semi-analytical techniques with other more numerical methods 
such as finite difference or finite element. 

FIMMWAVE comes with a range of user-friendly visual tools for designing 
waveguides, each optimized for a different geometry: rectangular geometries often 
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encountered in epitaxially grown integrated optics, circular geometries for the design 
of fibre waveguides, and more general geometries to cover, e.g. diffused waveguides or 
other unusual structures. 

CrystalWave is a design environment for the layout and design of integrated optics 
components optimized for the design of photonic crystal structures. It is based on both 
FDTD and finite-element frequency-domain (FEFD) simulators and includes a mask file 
generator optimized for planar photonic crystal structures. 

4. CST MICROWAVE STUDIO (www.cst.com/Content/Products/MWS/Overview.aspx) 
is a specialist tool for the 3D electromagnetic simulation of high frequency components. 
It enables the fast and accurate analysis of high frequency devices such as antennas, 
filters, couplers, planar and multi-layer structures. 

There are several web resources devoted to photonics software and numerical modelling 
in photonics. We mention the following: Optical Waveguides: Numerical Modeling Website 
[10], Photonics resources page [11] and Photonics software [12], 


1.3 Optical fibre communication 


In this section we will outline the important role played by photonics in the communication 
over an optical fibre. Other sub-fields will be summarized in due course. 

1.3.1 Short story of optical fibre communication 


Light was used as a mean for communication probably since the origin of our civilization. 
Its modem applications in telecommunication, as developed in the twentieth century, can 
be traced to two main facts: 

• availability of good optical fibre, and 

• compact and reliable light sources. 

Later in this section, we will discuss main developments in some detail. We start with 
a short history of communication with the emphasis on light communication. A recent 
popular history of fibre optics written for a broad audience was published by Hecht [13], 
As an introductory work we recommend simple introduction to fundamentals of digital 
communications by Bateman [14]. 

1.3.2 Short history of communication 


After discovery of electromagnetic waves due to work by Marconi, Tesla and many others, 
the radio was created operating in the 0.5-2 MHz frequency range with a bandwidth of 
15 kHz. With the appearance of television which required bandwidth of about 6 MHz, 
the carrier frequencies moved to around 100 MHz. During the Second World War the 
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Transmitter 


Block diagram of a fibre system. 


Channel 


Receiver 


invention of radar pushed the frequencies to the microwave region (around GHz). Those 
frequencies (range of 2.4-5 GHz) are now actively used by cell phones as well as wireless 
links. 

The big step toward higher frequencies was made possible in the 1960s with the invention 
of laser. The first one operated at the 694 nm wavelength. This corresponds to carrier 
frequency of approximately 5 x 10 14 Hz (5 x 10 5 GHz = 500 THz). Utilization of only 
1% of that frequency represents a signal channel of about 5 THz which can accommodate 
approximately 10 6 analogue video channels, each with 6 MHz bandwidth or 10 9 telephone 
calls at 5 kHz per call. 

A typical communication channel which links two points, point-to-point link (as an 
example, see Fig. 1.1 for a generic fibre optics link), consists of an optical transmitter 
which is usually a semiconductor laser diode, an optical fibre intended to carry light beam 
and a receiver. A semiconductor laser diode can be directly modulated or an external 
modulator is used which produces modulated light beam which propagates in the optical 
fibre. Information is imposed into optical pulses. Then the light is input into the optical 
fibre where it propagates over some distance, and then it is converted into a signal which a 
human can understand. 

All of those elements should operate efficiently at the corresponding carrier’s frequencies 
(or wavelengths). In the 1960s only one element was in place: a transmitter (laser). The 
other two elements were nonexistent. 

The next main step was proposed in 1966 by Charles Kao and George Hockham [15] who 
demonstrated the first silica-based optical fibre with sufficiently low enough propagation 
loss to enable its use as a communication medium. Soon, silica-based optical fibre became 
the preferred means of transmission in both long- and short-haul telecommunication net¬ 
works. An all-optical network has the potential for a much higher data rate than combined 
electrical and optical networks and allows simultaneous transmission of multiple signals 
along one optical fibre link. 

Below, we summarize selected developments of the early history of long-distance com¬ 
munication systems (after Hecht [13] and Einarsson [16]): 

• TAT-1 (1956) First transatlantic telephone system contained two separate coaxial cables, 
one for each direction of transmission. The repeaters (based on vacuum tubes) were 
spliced into the cables at spacing of 70 km. Capacity of the transmission was 36 two-way 
voices. 
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• TAT-6 (1976) Capacity of 4200 two-way telephone channels. Repeaters (based on tran¬ 
sistors) were separated at a distance of 9.4 km. 

• TAT-7 (1983) System similar to TAT-6. 

• TAT-8 (1988) First intercontinental optical fibre system between USA and Europe. 

• HAW-4 (1988) System installed between USA and Hawaii; similar to TAT-8. 

• TCP-3 Extension of HAW-4 to Japan and Guam. 

• TAT-12 (1996) First transatlantic system in service with optical amplifiers. 

A typical optical fibre used in TAT-8 system supported single-mode transmission [16]. 
It had outer diameter of 125 pm and core diameter of 8.3 pm. It operated at 1.3 pm 
wavelength. Digital information was transmitted at 295.6 Mbits. The repeaters which 
regenerated optical signals were spaced 46 km apart. 

Some of the main developments that took place in relation to development of fibre 
communication are summarized below (compiled using information from Ref. [13]): 

• October 1956: L. E. Curtiss and C. W. Peters described plastic-clad fibre at Optical 
Society of America meeting in Lake Placid, New York. 

• Autumn 1965: C. K. Kao concludes that the fundamental limit on glass transparency is 
below 20 dB km -1 which would be practical for communication. 

• Summer 1970: R. D. Maurer, D. Keck and P. Schultz from Coming Glass Works, USA 
make a single-mode fibre with loss of 16 dB km -1 at 633 nm by doping titanium into 
fibre core. 

• April 22, 1977: General Telephone and Electronics. First live telephone traffic through 
fibre optics, at 6 Mbits -1 in Long Beach, CA. 

• 1981: British Telecom transmits 140 Mbits s -1 through 49 km of single-mode fibre at 
1.3 pm wavelength. 

• Late 1981: Canada begins trial of fibre optics to homes in Elie, Manitoba. 

• 1988: L. Mollenauer of Bell Labs demonstrates soliton transmission through 4000 km of 
single-mode fibre. 

• February 1991: M. Nakazawa, NTT Japan sends soliton signals through a million km of 
fibre. 

• 1994: World Wide Web grows from 500 to 10 000 servers. 

• 1996: Introduction of a commercial wavelength-division multiplexing (WDM) system. 

• July 2000: Peak of telecom bubble. JDS Uniphase announces plans to merge with SDL 
Inc. in stock deal valued at $41 billion. 

• 2001: NEC Corporation and Alcatel transmitted 10 terabits per second through a single 
fibre [17]. 

• December 2001: Failure of TAT-8 submarine cable. 

The capacity of optical fibre communication systems grew very fast. Comparison with 
Moore’s law for computers which states that computer power doubles every 18 months, 
indicates that fibre capacity grows faster [18], In 1980, a typical fibre could carry about 
45 Mb s -1 ; in 2002 a single fibre was able to transmit more than 3.5 Tb s -1 of data. Over 
those 22 years the computational power of computers has increased 26 000 times whereas 
the fibre capacity increased 110 000 times over the same period of time. 
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Historical reduction of loss as a function of time from ancient times to the present. Copyright 1989 IEEE. Reprinted, 
with permission from S. R. Nagel, IEEE Communications Magazine, 25,33 (1987). 


1.3.3 Development of optical fibre 


Glass is the principal material used to fabricate optical fibre. To be useful as a transmission 
medium it has to be extremely clean and uniform. In Fig. 1.2 we illustrated the history of 
reduction losses in glass as a function of time (after [19]). To make practical use of glass as 
a medium used to send light, an important element, and in fact the very first challenge was 
to develop glass so pure that 1% of light entering at one end would be retained at the other 
end of a 1 km-thick block of glass. In terms of attenuation this 1% corresponds to about 20 
decibels per kilometre (20 dB km -1 ). 

In 1966 Kao and Hockhman [15] suggested that the high loss in glass was due to 
impurities and was not the intrinsic property of glass. In fact, Kao suggested to use optical 
fibre as a transmission medium. 

Kao’s ideas were made possible in 1970 with the fabrication of low-loss glass fibre by 
the research group from Coming Glass Works [20]. That work established a potential of 
optical communications for providing a high bandwidth and long distance data transmission 
network. Coming Glass Works were the first to produce an optical glass with a transmission 
loss of 20 dB km -1 , which was thought to be the threshold value to permit efficient optical 
communication. 
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Before the Corning work, around 1966, the power loss for the best available glass was 
about 1000 dB km -1 which implies a 50% power loss in a distance of about 3 m [21], Today, 
most optical fibres in use are so-called single-mode fibre able to maintain single mode of 
operation (see Chapter 5 for the relevant definitions). Commercially available fibres have 
losses of about 0.2 dB km -1 at the wavelength of 1550 nm and are capable of transmitting 
data at 2-10 Gbit s -1 . 

The measured spectral loss of a typical low-loss optical fibre is shown in Fig. 1.3 [22], 
The measured fibre had 9.4 pm core diameter and a 0.0028 refractive index difference 
between the core and cladding. Peaks were found at about 0.94, 1.24 and 1.39 pm. They 
were caused by OH vibrational absorption. The rise of the loss from 1.7 pm is mainly 
attributed to the intrinsic infrared absorption of the glass. 

The locations of the first, second and third communications windows are also shown. The 
most common wavelengths used for optical communication span between 0.83-1.55 pm. 
Early technologies used the 0.8-0.9 pm wavelength band (referred to as the first window) 
mostly because optical sources and photodetectors were available at these wavelengths. 
Second telecommunication window, centered at around 1.3 pm, corresponds to zero dis¬ 
persion of fibre. Dispersion is a consequence of the velocity propagation being different for 
different wavelengths of light. As a result, when pulse travels through a dispersive media it 
tends to spread out in time, the effect which ultimately limits speed of digital transmission. 
The third window, centered at around 1.55 pm, corresponds to the lowest loss. At that 
wavelength, the glass losses approach minimum of about 0.15 dB km -1 . 

For the sake of comparison, the loss of about 0.2 dB km -1 corresponds to 50% power 
loss after propagation distance of about 15 km. 


1.3.4 Comparison with electrical transmission 


Historically, copper was a traditional medium used to transmit information by electri¬ 
cal means. Its data handling capacity is, however, limited. Only around 1970, after the 
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fabrication of relatively low-loss optical fibre, it started to be used in an increasing scale. 
From practical perspective, the choice between copper or glass is based on several factors. 

In general, optical fibre is chosen for systems which are used for longer distances and 
higher bandwidths, whereas electrical transmission is preferred in short distances and 
relatively low bandwidth applications. The main advantages of electrical transmission are 
associated with: 

• relatively low cost of transmitters and receivers 

• ability to carry electrical signals and also electrical power 

• for not too large quantities, relatively low cost of materials 

• simplicity of connecting electrical wires. 

However, it has to be realized that optical fibre is much lighter than copper. For example, 
700 km of telecommunication copper cable weighs 20 tonnes, whereas optical fibre cable 
of the same length weighs only around 7 kg [23]. 

The main advantages of optical fibre are: 

• fibre cables experience effectively no crosstalk 

• they are immune to electromagnetic interference 

• lighter weight 

• no electromagnetical radiation 

• small cable size. 


1.3.5 Governing standards 


In order for various manufacturers to be able to develop components that function com¬ 
patibly in fibre optic communication systems, a number of standards have been developed 
and published by the International Telecommunications Union (ITU) [24], Other standards, 
produced by a variety of standards organizations, specify performance criteria for fibre, 
transmitters and receivers to be used together in conforming systems. 

These and related standards are covered in a recent publication [25], It is the work 
of over 25 world-renowned contributors and editors. It provides a reference to over a 
hundred standards and industry technical specifications for optical networks at all levels: 
from components to networking systems through global networks, as well as coverage of 
networks management and services. 

The ITU has specified six transmission bands for fibre optic transmission [18]: 

1. the O-band (original band) - 1260-1310 nm, 

2. the E-band (extended band) - 1360-1460 nm, 

3. the S-band (short band) - 1460-1530 nm, 

4. the C-band (conventional band) - 1530-1565 nm, 

5. the L-band (long band) - 1565-1625 nm, 

6. the U-band (ultra band) - 1625-1675 nm. 

A seventh band, not defined by the ITU, runs around 850 nm and is used in private networks. 
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Illustration of TDM and WDM formats. 



WDM transmission system. 


1.3.6 Wavelength division multiplexing (WDM) 


Wavelength division multiplexing (WDM) is a technique which allows optical signals with 
different wavelengths to propagate in a single optical fibre without affecting one another. 
The technique was invented to increase the information-carrying capacity of a fibre. WDM 
is not the only possible way to increase fibre capacity. In Fig. 1.4 we illustrate two basic 
formats, each allowing an increase of fibre capacity, time-division multiplexing (TDM) and 
wavelength-division multiplexing (WDM). In both formats messages from various channels 
are combined (multiplexed) onto a single information channel. This can be done in time or 
at various frequencies (wavelengths). 

There is extensive literature on the principles, concepts and components for WDM, 
see [26], [27], [28], [29], [30] to just name a few. The WDM transmission system is 
shown in Fig. 1.5. Here, MUX denotes a multiplexer. Its role is to combine light from 
several sources, each operating at a particular wavelength carrying one channel, to the 
transmission fibre. At the receiving end, the demultiplexer (DMUX) separates different 
wavelengths. 
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1.3.7 Solitons 


In the mid 1980s researchers started experimenting with solitons. The soliton phenomenon 
was first described by John Scott Russell, who in 1834 observed a solitary wave in a narrow 
water channel (the Union Canal in Scotland). 

Solitons are pulses of special shape which travel without changing it. Pulse broadening is 
due to the fact that some wavelengths travel faster than other wavelengths, the effect known 
as dispersion. In fibre, there also exists nonlinearity. When taken together, dispersion and 
nonlinearity, they compensate for each other, thus producing conditions where pulse shape 
remains unchanged. To form a soliton, the initial pulse must have a particular peak energy 
and pulse shape. 

In 1973, Akira Hasegawa of AT&T Bell Labs was the first to suggest that solitons could 
exist in optical fibre. In 1988, Linn Mollenauer and his team transmitted soliton pulses over 
4000 km [31]. A recent overview of soliton-based optical communications was provided 
by Hasegawa [32] and by Mollenauer and Gordon [33], 


1.4 Biological and medical photonics 


A special issue of the IEEE Journal of Selected Topics in Quantum Electronics [34] 
summarizes hot topics in bio-photonics until circa 2005. Some of them are directly 
related to computational bio-photonics, such as elastic light scattering properties of 
tissues [35]. 

Two special issues of IEEE Journal of Selected Topics in Quantum Electronics on bio¬ 
photonics [36], [37] summarize recent progress and trends. Biological applications of 
stimulated parametric emission microscopy and stimulated Raman scattering microscopy 
are reviewed by Kajiyama et al. [38], As an introduction to the subject we recommend the 
book by Prasad [39], 


1.5 Photonic sensors 


Photonic sensors have been the subject of intensive research over the last two decades. They 
are emerging as an important branch of photonics. Their development is based on analysing 
different components of the optical signal, like intensity, polarization, pulse shape or arrival 
time and also other phenomena like interference. Utilizing those various possibilities leads 
to different types of sensors. There are almost endless possibilities for their use. The best 
known applications are in civil and military environments for detection of a wide variety 
of biological, chemical and nuclear agents. 

An important medium for sensing is the optical fibre and its recent variations which 
found their own ways of applications in sensing. These include: photonic band gap 
fibres (PBG), microstructure optical fibres (MOF), random hole optical fibres (RHOF) 
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and hybrid ordered random hole optical fibres (HORHOF). They offer higher resolu¬ 
tion, lower cost and/or expanded detection range capability for sources and detection 
schemes. 

In this book we will not explicitly concentrate on analysis and/or simulations of photonic 
sensors, although some of the discussed properties of photonic devices can be directly 
applied in sensors. 

As an example, the use of light emitting diodes (LED) in sensing has been summarized re¬ 
cently by O ’ Toole and Diamond [40] . The authors described development and advancement 
of LED-based chemical sensors and sensing devices. 

Another example is device based on a high-finesse, whispering-gallery-mode disk res¬ 
onator that can be used for the detection of biological pathogens [41], The device operates 
by means of monitoring the change in transfer characteristics of the disk resonator when 
biological materials fall onto its active area. 

The open access journal Sensors (ISSN 1424-8220; CODEN: SENSC9) [42] on science 
and technology of sensors and biosensors is published monthly online. A special issue: 
Photonic Sensors for Chemical, Biological, and Nuclear Agent Detection was published 
recently. It can be considered a good source of current developments and applications of 
new physical phenomena in sensing. 


1.6 Silicon photonics 


The application of silicon in photonics certainly needs a separate discussion. Silicon is a 
material of choice in electronics and it is also economical to build photonic devices on 
this material [43], see also the contribution by Soref [44] for an overview of optoelectronic 
integrated circuits. However, silicon is not a direct-gap material and therefore cannot 
efficiently generate light. Intensive work on those issues is taking place in many institutions, 
see the recent summary of the Intel-UC Santa Barbara collaboration on the hybrid silicon 
laser [45], 

On another front, associated with computer technology, the continuation of Moore’s Law 
and progress which it reflects is becoming increasingly dependent on ultra-fast data transfer 
between and within microprocessor. The existing electrical interconnects are expected to 
be replaced by high speed optical interconnects. They are seen as a promising way forward, 
and silicon photonics is seen as particularly useful, due to the ability to integrate electronic 
and optical components on the same silicon chip. 

Particularly significant progress has been seen in the advances of optical modulators 
based on silicon, see the recent report by a leading collaboration [46]. 

In designing and optimizing those devices simulations play a very important role. Re¬ 
cent advances in modelling and simulation of silicon photonic devices are summarized 
by Passaro and De Leonardis [47], They review recently developed simulation methods 
for submicrometre innovative Silicon-on-Insulator (SOI) guiding structures and photonic 
devices. 
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1.7 Photonic quantum information science 


A new emerging direction within photonics is related to quantum communications and 
quantum information processing. Recent progress in this subfield is summarized by Politi 
et al. [48]. Here, we briefly outline recent main developments. 

Photons potentially can play the crucial role in the quantum information processing due 
to their low-noise properties and ease of manipulation at the single qubit level. In recent 
years several quantum gates have been implemented using integrated photonics circuits. 

1. A theoretical proposal for the implementation of the quantum Hadamard gate using 
photonic crystal structures [49], An integrated optics Y-junction beam splitter has been 
proposed for realization of this gate. Reported numerical simulations support theoretical 
concepts. 

2. A group from Bristol demonstrated the capability to build photonic quantum circuits 
in miniature waveguide devices [50] and optical fibres [5 1], They reported recently on high- 
fidelity silica-on-silicon integrated optical realizations of key quantum photonic circuits, 
including two-photon quantum interference and a controlled-NOT logic gate [52], They 
have also demonstrated controlled manipulation of up to four photons on-chip, including 
high-fidelity single qubit operations, using a lithographically patterned resistive phase 
shifter [53], 
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2 Basic facts about optics 

L._ 


In this chapter, we review basic facts about geometrical optics. Geometrical optics is 
based on the concept of rays which represent optical effects geometrically. For an excellent 
introduction to this topic consult work by Pedrotti and Pedrotti [1], 


2.1 Geometrical optics 


2.1.1 Ray theory and applications 


We visualize propagation of light by introducing the concept of rays. These rays obey 
several rules: 

• In a medium they travel with velocity v given by 



where c = 3 x 10 s m/s is the velocity of light in a vacuum and the quantity n is known 
as refractive index. (We introduced here the notation n for refractive index instead of 
widely accepted n in the optics literature to avoid confusion with electron density, n 
which will be used in later chapters.) Some typical values of refractive indices are shown 
in Table 2.1. 

• In a uniform medium rays travel in a straight path. 

• At the plane of interface between two media, a ray is reflected at the angle equal to the 
angle of incidence, see Fig. 2.1 . At the plane interface, the direction of transmitted ray is 
determined by Snell’s law 

«i sin 0| = « 2 sin @ 2 (2.2) 

where n\ and «2 are the values of refractive indices in both media and 9\ and 62 are the 
angles of incidence and refraction, respectively. 

The general relation between incoming ray A and the reflected ray B is 

B = R - A (2.3) 

with R reflection coefficient (complex). Reflection coefficients for two main types of waves 
which can propagate in planar waveguides are given by Fresnel formulas and will be 
determined in the next chapter. 
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Illustration of critical angle. 


2.1.2 Critical angle 


From Snell’s law one can deduce the conditions under which light ray will stay in the 
same medium (e.g. dense) after reflection, see Fig. 2.2. The situation shown corresponds 
to O2 = ix jl. Angle 0\ then becomes known as the critical angle, 9 C and its value is: 


m 


(2.4) 










Geometrical optics 




2.1.3 Lenses 


We discuss here only thin lenses, defined when the vertical translation of a ray passing 
through lens is negligible. The ray enters and leaves the lens at approximately the same 
distance from the lens axis. 

An important use of lenses in fibre optics is to focus light onto a fibre, see Fig. 2.3. Here, 
a collimated beam of light (parallel to the lens axis) is focused to a point. The point is 
known as the focal point and is located at a distance /, known as focal length from the lens. 
All the collimated rays converge to the focal point. 

Focal distance / is related to the curvatures R\ and R2 of the spheres forming lens as 




(2.5) 


If beam of light travels at some angle relative to the lens axis, Fig. 2.4, the rays will be 
focused in the focal plane. 
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A cylindrical disk with graded refractive index. 


2.1.4 GRIN systems 


GRIN systems whose name originates from GRadient in the INdex of refraction play the 
role similar to lenses. Here, the refractive index of the material is inhomogeneous. Spatial 
dependence of refractive index serves as yet another parameter in the design of an ideal 
lens. 

The simple analysis of GRIN structure now follows [2], see Fig. 2.5. We have a flat 
cylindrical disk where the refractive index n{r) depends on radius r (distance from the 
axis), keeps its maximum at the optical axis and decreases outwards. In order to focus 
optical rays at the focal point F, the optical paths of all rays must be the same. (Optical path 
is defined as a product of geometrical distance in a given region and the value of refractive 
index in that region.) Optical path difference of the central ray and the one going through 
region with n{r) between planar wave front on the left and spherical wavefront on the right 
should be zero 

«max • d « n(r) ■d + DE (2.6) 


Distance DE is determined as follows: EF = f, where / is the focal length, DF = DE + f 
and DF sjr 2 + f 1 . Combining those relations, we have 

DE = Jr 2 + / 2 - / (2.7) 


Use the expansion 


'?T7wJ 1 + ^/(i + |r) 


Apply the above expansion in (2.7) and obtain 


( 2 . 8 ) 


Finally, the dependence of refractive index is 


(2.9) 
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The above equation tells us that in order to focus parallel light by the GRIN plate, the index 
of refraction should depend parabolically on radius. 


2.2 Wave optics 


Light is an electromagnetic wave and consists of an electric field vector and a perpendicular 
magnetic field vector oscillating at a very high rate (of the order of 10 14 Hz) and travelling 
in space along direction which is perpendicular to both electric and magnetic field vectors. 
(These topics are discussed in more detail in the next chapter.) Here we state that the electric 
field vector E(r,t) obeys the following wave equation: 


V 2 E(r, t) — 


1 3 2 E(r, t) 

? 3 1 2 


( 2 . 11 ) 


The above wave equation is time invariant; that is it does not change under the replacement 
t —» —t. We will soon utilize that fact in establishing the so-called Stokes relations. 

The one-dimensional wave equation in a medium is 


J_ 3 ^ 

v 2 3 t 1 


E(z, t) = 


3 z 2 


E(z, t) 


where v is the velocity of light in the medium. An exact solution can be expressed in terms 
of two waves E + and E~ travelling in the positive and negative z-directions 


E(z, t ) = E + (z - vt) + E~ (z + vt ) 


Dispersion relation for the above wave equation is obtained by substituting E ( z , t ) = 
E 0 sin (cot — kz) into the wave equation, performing differentiation and dividing both sides 
by E(z,t). The obtained dispersion relation which relates k and co is 

co = ±v ■ k 


It shows a linear dependence between angular frequency co of the propagating wave and 
its wavenumber k. Angular frequency co is related to frequency as co = 2n f whereas 
wavenumber k is determined as 



Here X is the wavelength of the light wave and v is the phase velocity of the wave. The 
factor cot — kz is known as the phase of the wave. 


Example When electromagnetic wave propagates in a medium where losses are significant, 
the attenuation must be incorporated into an expression for electric field of the wave. The 
relevant expression is 


E(z,t) = E 0 e “ z sin(<wt — kz) 
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where a is the attenuation coefficient. The intensity of light is proportional to the square 
of its electric field. Therefore, the power of the light beam decreases as exp(-2aZ.) during 
the propagation distance L. If we measure the distance in km, the unit of a is km -1 . Power 
reduction in decibels (dB) is determined as 

dB = 10 log 10 e~ 2aL (2.12) 

Find the relation between the attenuation coefficient and the power change in dB/km. 

Solution 
One has 

Loss(dB) = 101og 10 e~ 2aL 

m —20 • a • Z • loge 
= -8.685 • a L 


Therefore 


Loss(dB)/L(km) = -8.685 • a 


or 


dB/km = -8.685 • a 


where a is in km 1 . 


(2.13) 


2.2.1 Phase velocity 


Consider a plane monochromatic wave propagating along the z-axis in an infinite medium, 
see Fig. 2.6 

E(z,t) = Eqcos (kz — cot) (2.14) 

Select one point, say at the amplitude crest, and analyse its movement. If we assume that 
this point maintains constant phase as 

kz — cot = const 


we can determine velocity of that point. From the above relation 


(2 - i5) 

The quantity v p is known as phase velocity and it represents the velocity at which a surface 
of constant phase moves in the medium. Velocity in Eq. (2.1) should be interpreted as phase 
velocity and Eq. (2.1) be replaced by 


(2.16) 
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2.2.2 Group velocity 


The group velocity describes the speed of propagation of a pulse in a medium. To see 
this fact, consider propagation of two plane waves with different parameters but equal 
amplitudes 

E\ (z, t) = E 0 cos ( k\z — w\t ), E 2 (z, t) = E 0 cos ( k 2 z — co 2 t) (2.17) 

Assume that their frequencies and wavenumbers differ by 2 A co and 2Ak and write 
coi = co+ Aco, co 2 = co — A co 
k\ = k + A k, k 2 = k — A k 

Superposition of those waves results in a wave E (z, t ) which is 
E(z,t) = E\(z, t) + E 2 (z, t) 

m E 0 {cos [(k + A k)z— (w + A oo)t] + cos [(& — A k)z — (co — Aco)f]} 
Using trigonometric identity 

2 cos a cos fi = cos (a + /J) + cos (a — f3) 
allows us to write expression for E(z, t) as 

E ( z, t ) = 2Eq cos ( kz — cot) cos (A kz — A cot) (2.18) 

Eq. (2.18) represents a wave at carrier frequency co that is modulated by a sinusoidal 
envelope at the beat frequency Aco, see Fig. 2.7. Phase velocity of the carrier wave is 
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v p = oo/k. The envelope moves with velocity v g which is 


Aco dco 

A k dk 


(2.19) 


In a free space a> = kc which is known as dispersion relation, the group velocity is 
v g = ^jr = c. In a free space (vacuum), the phase and group velocities are identical. In 
media, the dispersion relation, i.e. co = co(k) dependence is usually a complicated function. 


Example Write a MATLAB program to visualize phase and group velocities based on 
Eqs. 2.15 and 2.19. 

Solution 

The MATLAB code for phase velocity is provided in the Listing 2A.1 and in the 
Listing 2A.2 for group velocity. The results are shown in Fig. 2.6 and Fig. 2.7. 


2.2.3 Stokes relations 


Stokes relations relate reflectivities and transmittivities of light at the plane of interface 
between two media. Assume that electric field E which is represented by the corresponding 
ray interacts with the interface and is partially reflected and partially transmitted, see Fig. 2.8. 
Here r, t are reflection and transmission coefficients for fields (not intensities) propagating 
from medium 2 to medium 1 (here, from top to bottom). Similarly, d, t' denote the reflection 
and transmission coefficients for fields propagating from medium 1 to medium 2. 
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© 


© 



rE 


Transmission and reflection at the interface used in deriving Stokes relations. 


© 


© 



rE 


Transmission and reflection at the interface used in deriving Stokes relations. Time-reversed process. 



| Transmission and reflection at the interface used in deriving Stokes relations. Full processes. 


At the beginning of this section we have established the property of time invariance of 
the wave equation. Time reversal invariance tells us that the above picture (reflection and 
transmission) should also work in reverse time, see Fig. 2.9 where we combined rays tE 
and rE to form ray E. 

However, when one inputs two rays (tE and rE) at the interface between two media 1 and 
2, one from above and one from below, there should exist partial reflection and transmission 
for both rays, as is illustrated in Fig. 2.10. 

A comparison of Fig. 2.9 and Fig. 2.10 reveals physically identical situations, and 
therefore the following relations hold: 

r^E + t'tE = E 
rtE + r'tE = 0 

From the above we obtain Stokes relations 

t't = 1 - r 2 
/ = -r 






Basic facts about optics 



B 

^ Interference in a single film with light incident at arbitrary angle 0,-. 


2.2.4 Interference in dielectric film 


Interference in dielectric film is an important phenomenon behind operation of optical 
devices, like filters. It is also a fundamental effect for the operation of the Fabry-Perot 
interferometer. 

To analyse this interference, consider a single layer dielectric film having refractive 
index «/ which is in contact with the medium having refractive index «o from above, 
see Fig. 2.11. For such a configuration, we will derive conditions for constructive and 
destructive interference. We will do this by calculating the optical path difference. 

The incident beam 1 creates on reflection and transmission two emerging beams 2 and 
3. The phase difference between beams 2 and 3 as measured at points C and D is due to 
optical path difference A (optical path is defined as a product of geometrical path times 
refractive index) and is 


A = n f (AB + BC) - n 0 AD 

From Fig. 2.11 one can determine the following trigonometric relations: 

AD 

(2.20) 

— = cos (90 — Bi) = sin 0, 

AC 

(2.21) 

h 

— = cos Ot 

AB 

AE 

(2.22) 

— = tan 9 t 

h 

and also 

(2.23) 

AC = 2-AE 

Combining Eqs. (2.23) and (2.24), one obtains 

(2.24) 

AC = 2 • t ■ tan 9, 

Substituting (2.25) into (2.21) we obtain geometrical path for AD part 

(2.25) 

AD = 2 ■ h ■ tan 6 t ■ sin 0 t 

Geometrical path AB + AC is obtained using Eq. (2.22) as 

(2.26) 

AB + AC = 2-AB = 2- 

cos 0 t 

(2.27) 
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Combining Eqs. (2.26) and (2.27) with (2.20) one finds the optical path difference as 


t sin Q t 

A = nf ■ 2 ■ --— no - 2 ■ t ■ -— • sin#,- 

cos 9 t cos 9, 

(2.28) 

2 hh f \ ho 1 

= - i- | 1-- sin 9, sm 0, i 

cos 9 t [ nf ) 

(2.29) 


Using Snell’s law 


«o sin 9, = nj sin 0, 


and the following trigonometric identity 

cos 2 9 t + sin 2 0, = 1 
the curly bracket in Eq. (2.29) gives 

1 — ^ sin 0 t sin 9, =. ; 1 — sin 9 t hf sin 9 t 
n f n f 

— 1 — sin 2 9, = cos 2 9 t 


Employing the last result in (2.29), we finally obtain for the optical path difference 


A = 2 ■ h ■ nf ■ cos6 t (2.30) 

Thus, the optical path difference is expressed in terms of the angle of refraction. For normal 
incidence 0,- = 9 t = 0 and A = 2hfh. The phase difference 8 acquired by light ray due to 
travel in the film corresponding to optical path difference A is 

8 = k- A=^-A (2.31) 

A 0 


2.2.5 Multiple interference in a parallel plate 


Based on discussion from the previous section, we will now consider interference of multiple 
beams in a parallel plate, see Fig. 2.12. Here, E 0 is the amplitude of the incoming light, r, t 
are the coefficients of external reflections and transmissions, and /, t' are coefficients of 
internal reflections and transmissions. 
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Phase difference between two successive reflected beams which exists due to travel in 
the film having refractive index «/ is, Eq. (2.31) 


where A is given by Eq. (2.30). Analysing Fig. 2.12, one can write the following relations 
for outgoing beams 


E 0 e lo>t 

Ei m rE 0 e la, ‘ 

E 2 = ttVE 0 e m ‘~ 
E 3 = tty 3 E 0 e m ‘ 
£4 = tt'r' 5 Eoe m> 


incident beam 
first reflected beam 
second reflected beam 
third reflected beam 
fourth reflected beam 


kth reflected beam is 

E k = 

Sum of all reflected beams, Er is 

E r = Y^E k = rE 0 e io>t + Y^tt'r J< ' lk - 3) EQe i0,t e- i 


= E a e wt | r + ttVe~ iS / (2i-4) e -i( *“ 2)S | 


To perform the above infinite sum, use an elementary result for the sum of the geometric 
series 


S = ^V -2 = 1+ X + x 2 + • 


For \x\ < 1, the sum is S — 1/(1 — x). In our case here, x = r ,2 e " 5 , and total reflection is 
thus 

„ • , f tt'Ee-® 1 
Er = E 0 e 

Using Stokes relations, we obtain 

E r = Eoe™ 


1 - r' 2 e- a 

(1 - r 2 )re~‘‘ 


— E 0 e m 


\ 1 - r 2 e~ l 

r( 1 - e~ a ) 


1 - r 2 e~ iS 

Light intensity of the reflected beam is proportional to the irradiance Ir which is 

f g^d ~ e~ iS ) 1 [ e- mt (\ - 1 

1 — r 2 e~ iS j { 1 — r 2 e iS } 

2(1 - cos*) 


Ir ~ l^l 2 = Ey 

= E y 


1 + r 4 — 2 r 2 cos S 
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Introduce /, as the irradiance of the incident beam and It as the irradiance of the transmitted 
beam. We have 


h _ 1 %!* 

Ii ~ ' W 


and 


Ir + It = It 


which originates from energy conservation in nonabsorbing films. Using the above relations, 
one obtains 


2r 2 (l — cos<5) 

1 + r 4 — 2r 2 cos S 1 


(2.32) 


and 


d-r 2 ) 2 7 

1 + r 4 — 2 r 2 cos 5 ' 


(2.33) 


2.2.6 Fabry-Perot(FP) interferometer 


The FP interferometer is formed by two parallel plates with some medium (often air) 
between them. Expressions for transmission and reflection in such a system have been 
derived in the previous section. 

The expression for transmission can be used to determine the transmittance T (or Airy 
function) as follows: 


. It (1 - r 2 ) 2 


Ii 1 + r 4 — 2r 2 cos S 
1 - Ir 2 + r 4 

1 + r 4 - 2 r 2 + 4 r 2 sin 2 f 


1 + rTTSyr sin 2 | 


using trigonometric relation cos 5 = 1 — 2 sin 2 |. 
Define coefficient of finesse 


Transmittance is thus expressed as 


1 + F sin 2 | 


(2.34) 


(2.35) 
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Fabry-Perot fringe profile. 


Example Write a MATLAB program to analyse the transmittance of an FP etalon given by 
Eq. (2.35). 

Solution 

The MATLAB code is shown in Appendix, Listing A 2.3 and the FP fringe profile is shown 
in Fig. 2.13. The figure shows plots of transmission as a function of phase difference <5 for 
various values of external reflection r. 


2.3 Problems 


1. Use Snell’s law to derive an expression for critical angle 6 C . Compute the value of 0 C 
for a water-air interface (n water = 1.33). 

2. Isotropic light source located at distance d under water illuminates circular area of a 
radius 5 m. Determine d. 

3. A tank of water is covered with a 1 cm thick layer of linseed oil (n 0 u = 1.48) above 
which is air. What angle must a beam of light, originating in the tank, make at the 
water-oil interface if no light is to escape? 

4. Analyse the travel of a light ray through a parallel glass plate assuming refractive 
index of glass n. Determine displacement of the outgoing ray. 

5. A point source S is located on the axis of, and 30 cm from a plane-convex thin lens 
which has a radius of 5 cm. The glass lens is immersed in air. Determine the location 
of the image (a) when the flat surface is toward S and (b) when the curved surface is 
toward S. 
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Propagation of light in U-tube. 



6. Analyse multiple interference in a parallel plate FP etalon formed by two parallel 
plates with an active medium inside with gain g [3], Derive an expression for a signal 
gain. 

7. A glass rod of rectangular cross section is bent into U-shape as shown in Fig. 2.14 
[4], A parallel beam of light falls perpendicularly on the flat surface A. Determine 
the minimum value of the ratio R/d for which all light entering the glass through 
surface A will emerge from the glass through surface B. Assume that the refractive 
index of glass is 1.5. 

8. The Sellmeier dispersion equation is an empirical expression for the refractive index 
of a dielectric (like glass) in terms of the wavelength X. One of the most popular 
models is [5], [6] 


(2.36) 


Here G\, G 2 , G) and A;, X 2 , A3 are constants (called Sellmeier coefficients) which 
are determined by fitting the above expression to the experimental data. Some of the 
Sellmeier parameters are given in Table 2.2 (data combined from [5], [7] and [6]). 



Table 2.2 Sellmeier parameters for Si0 2 . j 

Parameter G\ 

Gz G 3 Aj (pm) X 2 (pm) X 3 (pm) 


SiO z 0.696749 0.408218 0.890815 0.0690660 0.115662 9.900559 


Write a MATLAB program to plot refractive index as a function of X from 0.5 pm 
to 1.8 pm for pure silica. 

9. Using the Sellmeier relation, plot reciprocal group velocity and group velocity for 
SiOz- Assume 1.36 < A.[pm] < 1.65. 


Appendix 2A: MATLAB listings 


In Table 2.3 we provided a list of MATLAB files created for Chapter 2 and a short description 
of each function. 
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Table 2.3 List of MATLAB functions for Chapter . j 

Listing 

Function name 

Description 

2A.1 

phase ^velocity, m 

Plots phase velocity 

2A.2 

group-velocity.m 

Plots group velocity 

2A.3 

FPjransmit.m 

Plots transmittance of Fabry-Perot etalon 


Listing 2A.1 Program phase_velocity.m. MATLAB program which plots phase velocity. 

7. File name: phase_velocity.m 
°/, Illustrative plot of phase velocity 
clear all 
N_max = 101; 

t = linspace(0,30d-15,N_max); 

7 . 

c = 3dl4; 
n = 3.4; 
v_p = c/n; 
lambda = 1.0; 
k = 2*pi/lambda; 
frequency = v_p/lambda; 
z = 0.6; 

omega = 2*pi*frequency; 

A = sin(k*z - omega*t); 
plot(t,A,’LineWidth’,1.5) 
xlabelC’Time’,’FontSize’,14); 
ylabel(’Amplitude’,’FontSize’,14); 

set (gca,’FontSize’, 14) ; 7o size of tick marks on both axes 

axis( [0 30d-15 -1.5 2]) 
text(17d-15, 1.3, ’z(t)’,’Fontsize’,16) 

line([1.53d-14,2d-14],[1,1],’LineWidth’,3.0) 7, drawing arrow 

line([1.9d-14,2d-14],[1.1,1],’LineWidth’,3.0) 
line([1.9d-14,2d-14],[0.9,1],’LineWidth’,3.0) 

close all 


7. number of points for plot 
7. creation of theta arguments 

7. velocity of light in microns/s 
7. refractive index 
7. phase velocity 
7. microns 


7. distance in microns 


Listing 2A.2 Function group_velocity.m. MATLAB program which plots group velocity. 
7. File name: group_velocity.m 

7. Illustrative plot of group velocity by superposition 
7. of two plane waves 
clear all 

N_max = 300; ’/, number of points for plot 
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t = linspace(0,300d-15,N_max); 

7 . 

c = 3dl4; 


7, velocity of light in microns/s 
7, refractive index 
7. phase velocity 
7, microns 


7. creation of theta arguments 


= 3.4; 


v_p = c/n; 
lambda = 1.0; 
frequency = v_p/lambda; 
z = 0.6; 


7. distance in microns 


omega = 2*pi*frequency; 
k = 2*pi/lambda; 

Delta_omega = omega/15.0; 

Delta_k = k/15.0; 

7. 

omega_l = omega + Delta_omega; 
omega_2 = omega - Delta_omega; 
k_l = k + Delta_k; 
k_2 = k - Delta_k; 

A = 2*cos(k*z - omega*t).*cos(Delta_k*z - Delta_omega*t); 
plot(t,A,’LineWidth’,1.3) 
xlabel(’Time’ , ’FontSize’,14) 
ylabel(’Amplitude’,’FontSize’,14) 

set(gca,’FontSize’,14); % size of tick marks on both axes 

close all 

Listing 2 A.3 Function FP transmit.m. Program plots transmittance of Fabry-Perot etalon. 
7, FP_transmit .m 

l Plot of transmittance of FP etalon 
clear all 

N_max = 401; l number of points for plot 

t = linspace(0,1 ,N_max) ; 7« creation of theta arguments 

delta = (5*pi)*t; angles in radians 


7. 

hold on 

for r = [0.2 0.4 0.8] 


7. reflection coefficient 


F = 4*r~2/ (l-r~2) . ~2; 7. coefficient of finesse 

T = l./(l + F*(sin(delta/2)).~2); 
plot(delta,T,’LineWidth’,1.5) 

end 

7. 

I Redefine figure properties 

ylabelC’Transmittance’,’FontSize’,14) 

xlabel(’Phase difference (rad)’,’FontSize’,14) 
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set(gca, ’FontSize’ ,14) ; °/ 0 size of tick marks on both axes 

text(6.1, 0.05, ’2 \pi’,’Fontsize’,14) 

text(12.2, 0.05, ’4 \pi’,’Fontsize’,14) 

text(8.5, 0.1, *r = 0.8’,’Fontsize’,14) 

text(8.5, 0.5, ’r = 0.4’,’Fontsize’,14) 

text(8.5, 0.8, ’r = 0.2’,’Fontsize’,14) 

% 

pause 
close all 


References 


[1] F. L. Pedrotti and L. S. Pedrotti. Introduction to Optics. Third Edition. Prentice Hall, 
Upper Saddle River, NJ, 2007. 

[2] E. Hecht. Optics. Fourth Edition. Addison-Wesley, San Francisco, 2002. 

[3] Y. Yamamoto. Characteristics of AlGaAs Fabry-Perot cavity type laser amplifiers. IEEE 
J. Quantum Electron., 16:1047-52, 1980. 

[4] L. Yung-Kuo, ed. Problems and Solutions on Optics. World Scientific, Singapore, 1991. 

[5] J. W. Fleming. Dispersion in Ge0 2 — Si0 2 glasses. Applied Optics, 23:4486-93,1984. 

[6] S. O. Kasap. Optoelectronics and Photonics: Principles and Practices. Prentice Hall, 
Upper Saddle River, NJ, 2001. 

[7] G. Ghosh, M. Endo, and T. Iwasaki. Temperature-dependent Sellmeier coefficients and 
chromatic dispersions for some optical fiber glasses. J. Lightwave Technol, 12:1338- 
42, 1994. 




3 Basic facts from electromagnetism 

__ 


In this chapter we review foundations of electromagnetic theory and basic properties of 
light. 


3.1 Maxwell's equations 


Our discussion is based on Maxwell’s equations which in differential form are [1], [2] 


9B 

V x E =- 

dt 

(3.1) 

9D 

VxH=J+— 

9f 

(3.2) 

V • D = p v 

(3.3) 

V • B = 0 

(3.4) 

where 


E is electric field intensity [ V/m\ 

B is magnetic flux density [T] 

H is magnetic field intensity [A/rri\ 

D is electric flux density [C/m 2 ] 

J is electric current density [A/m 2 ] 
p v is volume charge density [C/m 3 ]. 

The operator V in Cartesian coordinates is 


vJl.A.Al 

|_9x dy dzj 

(3.5) 

The above relations are supplemented with constitutive relations 


D = eE 

(3.6) 

B = pH 

(3.7) 

J = crE 

(3.8) 


where e = £o£ r is the dielectric permittivity [F/m], p = poP r is permeability [H/m], a is 
electric conductivity, s r is the relative dielectric constant. For optical problems considered 
here, p r = 1. 
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First, we recall two mathematical theorems, Gauss’s theorem 

j£ F-ds= J V ■ Fdv (3.9) 

where S is the closed surface area defining volume V, and Stokes’s theorem 

f V x F • ds (3.10) 


i ¥d, =b 


where contour L defines surface area A. With the help of the above theorems, Maxwell’s 
equations in differential form can be transformed into an integral form. Integral forms of 
Maxwell’s equations are 


<Z) D • ds = / p v dv 

(3.11) 

Js Jv 


(j) B • ds = 0 

(3.12) 

Js 



(3.13) 


(3.14) 


Properties of the medium are mostly determined by s, fi and a . Further, for the dielectric 
medium the main role is played by s. The medium is known as linear if e is independent 
of E; otherwise it is nonlinear. If it does not depend on position in space, the medium is 
said to be homogeneous; otherwise it is inhomogeneous. If properties are independent of 
direction, the medium is isotropic; otherwise it is anisotropic. 


3.2 Boundary conditions 


Boundary conditions are derived from an integral form of Maxwell’s equations. For that 
purpose we separate all vectors into two components, one parallel to the interface and one 
normal to the interface. The derivation of boundary conditions is facilitated by using the 
contour and cylindrical shapes as shown in Fig. 3.1. It will be done independently for the 
electric and magnetic fields. 

3.2.1 Electric boundary conditions 


First, analyse transversal components. Integrate Eq. (3.13) over closed loop C, see Fig. 3.1 
and then set A h ^ 0 

J E • dl = —Ei • d\ + E2 ■ d\ = —E\ t Aw + E 2 tA.uu = 0 

ABCDA 






Boundary conditions 



different dielectrics are shown. 


Therefore 


E\ t = $2t 


(3.15) 


The tangential component of the electric field is therefore continuous across the boundary 
between any two dielectric media. Using general relation (3.6) one obtains the boundary 
condition for tangential component of vector D 


Du 

S r l 


d 2 . 


(3.16) 


In order to obtain conditions for normal components, consider the cylinder shown in Fig. 3.1 . 
Here, ni and n 2 are normal vectors pointing outwards of the top and bottom surfaces into 
corresponding dielectrics. Apply Gauss’s law with integration over surface S of the cylinder 


/ 


D-c/s 




D 2 • n 2 ds = p s As 


The contribution from the outer surface of the cylinder vanishes in the limit Ah —> 0. Use 
the fact that n 2 = — n! and have 

m • (Dj - D 2 ) = p s (3.17) 

or 

s rl E 2 „ — s r2 E\ n = p s (3.18) 

when using general relation (3.6). Normal component of vector D is not continuous across 
boundary (unless p s = 0). 


3.2.2 Magnetic boundary conditions 


When deriving boundary conditions for magnetic fields, we use approach similar as for 
electric fields. We use Eq. (3.12) where integration is over the cylinder 

I Bds = 0 






Basic facts from electromagnetism 


Table 3.1 Summary of boundary conditions for electric and magnetic fields. ; 

Field components 

General form 

Specific form 

Tangential E 

n 2 x (Ei - E 2 ) = 0 

E u = E 2t 

Normal D 

n 2 ■ (D, - D 2 ) = p s 

Di, - D 2t = p s 

Tangential H 

n 2 x (H, - H 2 ) = J s 

H 2 , = H u 

Normal B 

n 2 ■ (B, - B 2 ) = 0 

Bm = B 2 „ 


or 

Bu=B 2n (3.19) 

Using general relation (3.7), one obtains condition for magnetic field H 

HrxHu = n r2 H 2n (3.20) 

The above relations tell us that normal component of B is continuous across the boundary. 

To obtain how transversal magnetic components behave across interface, apply Ampere’s 
law for contour C and then let Ah —»■ 0. One obtains 

J H d\ = J H 2 -dl - J Hi d\ = / 

c 

Here / is the net current crossing the surface of the loop. As we let Ah approach zero, the 
surface of the loop approaches a thin line of length Aw. Hence, the total current flowing 
through this thin line is / = J s ■ Aw, where J s is the magnitude of the normal component of 
the surface current density traversing the loop. We can therefore express the above equation 
as 

H 2t -H u =J s (3.21) 

Utilizing emit vector n 2 , the above relation can be written as 

n 2 x (Hi - H 2 ) = J (3.22) 

where n 2 is the normal vector pointing away from medium 2, see Fig. 3.1. J s is the surface 
current. 

In Table 3.1 we summarize boundary conditions between two dielectrics for the electric 
and magnetic fields. The behaviour of various field components is shown schematically in 
Fig. 3.2. 


3.3 Wave equation 


Here, we will derive the wave equation for a source-free medium where Q v = 0 and J = 0. 
The wave equation is obtained by applying Vx operation to both sides of Eq. (3.1). One 







Time-harmonic fields 


Boundary conditions for fields. 


obtains 


V 


V XE=-A(V 


d d dD 

B) = VxH) = -^ = ^ 


9 2 P 

w 


where we have used Maxwell Eq. (3.2) and relation (3.6). Next apply the following mathe¬ 
matical formula 


Vx VxE= V(V-E) —V 2 E (3.23) 

With the help of Maxwell Eq. (3.3), one finds finally 

V 2 E- M e^E (3.24) 

which is the desired wave equation. The quantity /xe is related to velocity of light in a 
vacuum as (assuming ii r = 1) 

lie = ii 0 e 0 e r = ” (3.25) 

c 1 

where n is the refractive index of the medium. 


3.4 Time-harmonic fields 


In many practical situations fields have sinusoidal time dependence and are known as 
time-harmonic. This fact is expressed as 

E(r, t) = i?e{E (r) e 7 ®'} (3.26) 

where E (r) is the phasor form of E (r, t ) and is in general complex. /?e (...) indicates 
‘taking the real part in’ quantity in brackets. Finally, co is the angular frequency in [rad/s]. 
In what follows, all fields will be represented in phasor notation. 
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Applying the time-harmonic assumption (3.26) to source-free Maxwell’s equations re¬ 
sults in 


V x E = —y'w/rH 
V x H = jcosE 


Applying the time-harmonic assumption again to wave 
V 2 E + k 2 E = 0 

where k = co^/JCs. Explicitly, the above wave equation: 

3 2 a 2 a 2 , A 
a? + 37 + i? + k j 


equation (3.24) gives 


is 

E i = 0 


with i = x, y, z. 


(3.27) 

(3.28) 


(3.29) 


(3.30) 


Example As an example, consider propagation of a uniform plane wave characterized by 
uniform electric field with nonzero component E x . Assume also 


d 2 E x 

~dxF 


= 0 , 


dy 2 


(3.31) 


The wave equation reduces to 


(3.32) 


and has the following forward propagating solution: 

E x (z) = E 0 e ! ' kz (3.33) 


Magnetic field is determined from the Maxwell’s Equation (3.27) 


= 0 0 | = -jcopi (a x H x + a y H y + a z H z ) (3.34) 

E x (z) 0 0 


Here a x , a v , a- are emit vectors along x, y, z axes, respectively. From the above equation, one 
finds 


H x = 0 

H = 1 dE x (z) 

jeofi a z 

H z = 0 


(3.35) 






Time-harmonic fields 



Using the solution for E x (z), one finally obtains the expression for magnetic field 

H = a yHy(z) (3.36) 

= — - — (—jkE x )a v (3.37) 

-JCOfl 

= a y —E x (z) 
co/x 

=«4 

where the impedance Z of the medium is defined as Z = Je. The propagating wave is 
shown in Fig. 3.3. 

We finish this section by providing a useful relation between electric and magnetic fields. 
Assuming time-harmonic and plane-wave dependence of both fields as 

E — expO'cuf - rk • r) 

and using constitutive relation (3.7), Maxwell Eq. (3.1) takes the form 
k x E = w/r-oH 

Introducing emit vector k along wave vector k and the expression for wave number k — 
am *Jfio£o, one finds 

six E = Z 0 H 


where Zq is the impedance of the free-space. 
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3.5 Polarized waves 


Here we will discuss the concept of polarization of electromagnetic waves. Polarization 
characterizes the curve which E vector makes (in the plane orthogonal to the direction 
of propagation) at a given point in space as a function of time. In the most general case, 
the curve produced is an ellipse and, accordingly, the wave is called elliptically polarized. 
Under certain conditions, the ellipse may be reduced to a circle or a segment of a straight 
line. In those cases it is said that the wave’s polarization is circular or linear, respectively. 
Since the magnetic field vector is related to the electric field vector, it does not need separate 
discussion. First, consider a single electromagnetic wave. 

3.5.1 Linearly polarized waves 


Consider an electromagnetic wave characterized by electric field vector E directed along 
the x-axis 

E = & x Eq cos (cot — kz+ (j>) (3.38) 

It is known as a linearly polarized plane wave with the electric field vector oscillating in 
the x direction. Wave propagates in the +z direction. In Eq. (3.38) a> = 2nv is the angular 
frequency and k is the propagation constant defined as 

k= - 

where v = c/n is the velocity of the electromagnetic wave in the medium having refractive 
index n. 4> is known as a phase of electromagnetic wave. 

The electromagnetic wave can also be written in the complex representation as 

E = a x E 0 j ( " ,t - ks+4,} (3.39) 

The actual field as described by Eq. (3.38) is obtained from (3.39) by taking real part. A 
more general expression for the electromagnetic wave is 

E = eE' 0 e i(a,( “ k - r+ ^ ) 

which is known as the plane polarized wave. Here unit vector 'e lies in the plane known as 
plane of polarization. It is perpendicular to vector k which describes direction of propagation 

k -'e — 0 


3.5.2 Circularly and elliptically polarized waves 


In general, when we have an arbitrary number of plane waves propagating in the same 
direction they add up to a complicated wave. In the simplest case, one has only two such 
plane waves. To be more specific, consider two plane waves oscillating along orthogonal 








Polarized waves 


directions. They are linearly polarized having the same frequencies and propagating in the 
same direction 


Ei = E x a x = 2 l x Eq x cos {cot — kz) (3.40) 

E 2 = EySy = &yEoy cos {cot — kz + 4>) (3.41) 

We want to know the type of the resulting wave and the curve traced by the tip of the total 
electric vector E = Ei + E 2 

E = Ei + E 2 = E 0 {cos {cot — kz) + cos {cot — kz+ cf>)} 


First, eliminate cos {cot — kz) term. From Eq. (3.40) 

E x 

cos {cot — kz) = — (3.42) 

Eox 

Use trigonometric identity 

cos (a - ft) = cosacos/f + sina sin/J 


to express Eq. (3.41) as follows: 

E y = Eoy | cos {cot — kz) cos 8 + jl. — cos 2 {cot — &z)] 1/2 sin <5 J 
Substitute Eq. (3.42) in the above and have 


Ey 


Squaring both sides gives 


E x ( El\ V2 

_ cos0 + ^__j sin * 


( Ey E x 

I TT- TT~ C °S <P 

\^0y &0x 


]£2 j? j? 2 

-|--2cos<ft y ■■_*■ + -f-cos 2 cj) + sin 2 cf> = sin 2 0 
E ly E 0y E 0x El El 


Finally, the above equation gives 

(%) + (S “ 2 (l;)(^) cos0=sinV 


(3.43) 


This is general equation of an ellipse. Thus the endpoint of E(z, t ) will trace an ellipse at 
given point in space. It is said that the wave is elliptically polarized. 

When phase cj> — f, the resultant total electric field is 


(IHJ 


which describes right elliptically polarized wave since as time increases the end of elec¬ 
tric vector E rotates clockwise on the circumference of an ellipse. Typical situations are 
illustrated in Fig. 3.4 where we show elliptic, circular and linear polarizations. 
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3.6 Fresnel coefficients and phases 


In this section we discuss electromagnetic wave undergoing reflection at the boundary 
between two dielectrics, see Fig. 3.5. The plane of incidence is defined as the plane formed 
by unit vector n normal to the interface between two media and the directions of propagation 
of the incident and reflected waves. 

We will derive so-called Fresnel coefficients [3] which are reflection and transmission 
coefficients expressed in terms of the angle of incidence and material properties (e dielectric 
constants) of the two dielectrics. Further, Fresnel phases are determined from Fresnel 
coefficients using the following definition [1]: 

r = e~ 2j * (3.44) 

Both reflection coefficients r and phases 0 will be calculated for both types of modes, TE 
and TM. 


3.6.1 TE polarization 


Referring to Fig. 3.6 the E u , E\ r , E 2t are complex values of incident, reflected and transmit¬ 
ted electric fields in medium 1 and 2. The incident electric field in medium 1 (E u ) is parallel 
to the interface between both media. Such orientation is known as TE polarization. It is 
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often said that electric field vector E is normal to the plane of incidence. Such configuration 
is also known as s-polarization. 

Boundary conditions require that the tangential components of the total field E and the 
total field H on both sides of the interface be equal. Those conditions result in the following 
equations: 

E u +E lr = E 2t (3.45) 


—Hu cos d\ +#1,. cos 0i = — H 2t cos 0 2 


We also have the following relations involving impedances in both media: 


Eu 

H u 


= Zj, 


E 2t 

lh t =Zl 


For a non-magnetic medium (ji r = 1) the impedance can be written as Z 
being the refractive index of the medium and 



(3.46) 
f, with n 


is the impedance of a free space. Replacing magnetic fields in the second equation of 
Eq. (3.45) by the electric fields using Eq. (3.46) gives 


Eli + E\r = E 2t 

E\i a E\r E 2t 

- — COS 01 + — COS 01 = — — COS 02 

Z| Zi Z2 


Reflection coefficient is defined 


From Eqs. (3.47) by eliminating E 2t and using definition (3.48) one obtains 5 

Z 2 COS 01 — Zi COS 02 

Yte — - 

Z 2 COS 01 + Zi COS 02 


(3.47) 


(3.48) 
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Replacing impedances Z by refractive indices n, we finally have 
cos 9\ - n 2 cos 0 2 


r TE = ~ 


n\ cos + n 2 cos 02 



(3.49) 


using Snell’s law. For angles 0i such that n\ — n] cos 0i < 0 the reflection coefficient 
becomes complex. For such cases, we write it as 



Such complex number can be expressed as 


where we have defined a + jb = e J,, ‘ TE . Finally, using the definition of Fresnel phase, 
Eq. (3.44) one obtains 



tan 4> te = 


(3.50) 


•0i 


n\ cos 0i 


The above equation represents phase shift during reflection for TE polarized electromagnetic 
wave. For a more detailed discussion about phase change on reflection consult books by 
Pedrotti [4] and Liu [5]. 


Example (a) Consider normal incidence of light on an air-silica interface. Compute the 
fraction of reflected and transmitted power. Also, express the transmitted loss in decibels. 
Assume refractive index of silica to be 1.45. (b) Repeat the calculations for Si which has 
ii = 3.50. (c) Consider coupling of GaAs optical source with a refractive index of 3.6 to a 
silica fibre which has refractive index of 1.48. Assume close physical contact of the fibre 
end and the source. 

Solution 

(a) The corresponding coefficient known as reflectance [4] is 



(3.51) 


Substituting values for air and silica, one has 



so about 3% of the light is reflected. The remainder, 97%, is transmitted. 
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The transmission loss in dB is 


—10 log 10 0.97 = 0.13[cf5] 


This result shows that there is about 0.2 dB loss when light enters glass from air. 
(b) For Si, using the same procedure we obtain 


/1.00 — 3.50\ 2 
V 1.00 + 3.50/ 


This means that about 31% of light is reflected, 
(c) The Fresnel reflection at the interface is 


= 0.309 


R = 


/3.60- 1.48 V 
V3.60+ 1.48 / 


= 0.174 


Therefore about 17.4% of the created optical power is emitted back into the source. The 
power coupled into optical fibre is 


^coupled — (1 V ^emitted 

The power loss, a in decibels is 

a = — 101og 10 ^°^ = -101og 10 (l -R) 

= — 101og 10 ((1 - 0.174)) - 101og 10 (0.826) = 0.83[dB] 


3.6.2 TM polarization 


Field configuration used to analyse reflection for TM polarization (magnetic field parallel 
to the interface) is shown in Fig. 3.7. Here, electric field vector E is parallel to the plane of 
incidence. Such configuration is also known as p-polarization. 
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We will derive coefficient of reflection t-tm for TM mode. As before the Eu, E\ r , E^ are 
(complex) values of incident, reflected and transmitted electric fields in medium 1 and 2. 
Similar notation holds for magnetic vectors. Boundary conditions require that tangential 
components are continuous across interface. The relevant conditions are 

E u cos6i —Ei r cos6i = E 2 tCos9 2 (3.52) 


Using Eqs. (3.46) for impedances to eliminate magnetic fields in the previous equations 
results in the following: 

Eu cos G\ — Ei r cos9\ = E 2t cos 9 2 (3.53) 

E\i E\r^ _ E 2t 

zT + zT _ ^ 

Reflection coefficient is defined as 

Elr 


From Eqs. (3.53) 


E u 

Z\ cos 9\ — Z 2 cos 9 2 

r TM — v- n -- 7T 

Z\ cos 9\ + Z 2 cos 9 2 

It can be also expressed in terms of refractive index 


«2 cos 9\ + n\ cos 9 2 
n 2 cos 9\ — — n\ cos 9\ 

n 2 cos 9\ + ,Jn\ — ri\ cos 9\ 

TM phase is obtained in the same way as for TE polarization. Final result is 
n\ -y/ ri\ sin * 1 2 9\ - n\ 


Again, for more information consult [4] and [5], 


3.7 Polarization by reflection from dielectric surfaces 


In interpreting the formulas for vje and i-jm we distinguish between two situations: 

(1) for tii <«2 or n = > 1, one defines so-called external reflection, 

(2) for «i >n 2 or n= ^ < 1, one defines so-called internal reflection. 

Example of (1) is air-to-glass reflection and example of (2) is the glass-to-air reflection. 
A plot of coefficients of reflection for n = 1.50 is shown in Fig. 3.8. MATLAB code is 
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shown in Appendix, Listing 3A.1. The so-called Brewster’s angle for which r TM = 0 is 
also shown. For more information, see the book by Pedrotti and Pedrotti [4], 

One can obtain polarization by reflection at Brewster’s angle, see Fig. 3.9. In the figure 
we illustrate properties of the reflected and transmitted light incident at the surface at 
Brewster’s angle. 

Unpolarized light is incident at the interface at Brewster’s angle 9b. Upon reflection one 
obtains polarized TM light. At the same time, the refracted light is only partially polarized 
since rjE is nonzero. 

At Brewster’s angle 9b of incidence, coefficient of reflection r TM (also known as r\\ since 
E is parallel to the plane of incidence) is zero; i.e. r\\ = 0. The above happens when the 
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sum of the angles of incidence and refraction is equal to n /2 



2 


No such angle exists for the s polarization (TE). Thus if an unpolarized light is incident at 
Brewster’s angle, the reflected light will be linearly polarized, in fact s-polarized. 


3.7.1 Expression for Brewster's angle 


Expression for r TM is 



Vanishing of Vtm corresponds to 9\ = 9b. We have 



from which we find an expression for Brewster’s angle 



3.8 Antireflection coating 


In Chapter 9 on semiconductor optical amplifiers (SOA) we will discuss why there is a 
need to reduce (or completely eliminate) the effect of reflections. Therefore, the practical 
question is how to eliminate reflections at the interface between two dielectrics. 

As seen before, light passing through a boundary between two dielectrics is lost due to 
reflection, see Eq. (3.51). The effect depends on the difference of the values of refractive 
indices between neighbouring layers. One can observe that for equal refractive indices there 
will be no reflection but also no refraction, which is not a very interesting possibility. 

A practical method of reducing reflections is to use several layers with properly selected 
values of refractive indices. For a single layer, interference of two reflected waves can lead 
to elimination of reflection, but only at a particular wavelength. 

Let us analyse this situation in more detail. Consider reflections of two waves R\ and R2, 
see Fig. 3.10. One of the conditions for destructive interference is that the amplitudes of 
both reflected waves should be equal. From Eq. (3.51), reflection of ray R\ at interface ‘a’ 
is described by reflection coefficient 
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R, 



T 


Illustration of reflection from single dielectric film layer. Case of destructive interference is shown. 


whereas reflection of ray R 2 at interface ‘b’ is described by coefficient 



Both coefficients should be equal which gives 

/ n 0 -n f \ 2 _ / n s -n f \ 2 
\n 0 + n f ) \n s + n f ) 

Assuming that the upper medium is air, i.e. w 0 = 1, from the above one finds expression 
for the value of refractive index for a layer as 



(3.57) 


Therefore, at a particular wavelength there will be no reflection once the refractive index 
of coating layer is given by (3.57). If there is a need to design structures producing no 
reflection over some frequency band, one must design multilayer structure consisting of 
layers with different refractive indices. In order to design such structures a more realistic 
description is necessary. Typical calculations are based on transfer matrix approach. 


3.8.1 Transfer matrix approach 


We now describe the transfer matrix approach, also known as the characteristic matrix 
approach [6], 

Consider a single dielectric layer with refractive index n deposited on a substrate with 
refractive index n s . Above it is a medium with refractive index no, see Fig. 3.11. 

We consider an s-polarized wave where electric field vector E is perpendicular to the 
plane of incidence. Boundary conditions dictate that tangential components of E and H 
fields are continuous across the interfaces, i.e. at positions z = z\ and z = z 2 . 

We introduce the convention that E+(z) denotes light propagating in the positive z 
direction whereas E _ (z) denotes light propagating in the negative z direction. Close to 
interface described by z = z\, due to continuity of tangential component one writes 


E(zj) = E+(z+) + E“(z+) = E+(zp + E“(zp 


(3.58) 
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Illustration of electromagnetic fields at the boundaries used in the derivation of transfer matrix. 


where z+ signifies value of coordinate z slightly larger than z \, etc. Similarly, close to z = z 2 
E(z 2 ) = E+(z+) + E“(z+) = E + (z“) + E-(z“) (3.59) 

For future use we introduce d = z 2 — zi. When field E(z) propagates in the film over 
distance d, it acquires phase S derived previously (see Eq. 2.3 1) 


8 = 

—nd cos a 

Xq 

(3.60) 

Electric fields that are close to interfaces 

are therefore related as 


E+(z-] 

l = E+(z+)e-“ 

(3.61) 

and 



E-(z- 

) = E-(z+)e iS 

(3.62) 

To determine the behaviour of a magnetic field, we use the fact that fields E and H are 


related as 

H=|-TxE 

A) 

where k is the unit vector along the wave vector k and Zo = is the impedance of the 
free space. The condition of continuity of tangential component of H vector gives: 
at z = zi 

Z 0 H(zi) = [E+(zp -E-(z-)]n 0 cosa 0 = [E+(z+) - E"(z+)] iicosa (3.63) 
at z = z 2 

Z 0 H(z 2 ) = [E + (zj ) — E - (zj )] ncosa = E + (zJ)« s cosa 
Substitute Eqs. (3.61) and (3.62) involving phase 8 into (3.59) and (3.64) to obtain 

E(z 2 ) = E+(z+)<T ia +E-(z+)e w 

H( Z2 ) = «cosa [E+(z + )e -« _ E - (z + )e «] 

Zq 


(3.64) 
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Solutions of the above equations are 

2E ~(z+)e iS = E(z 2 ) — zr— —H(z 2 ) 
n cos a 

2E+(z+)e- iS = E(z 2 ) + zr— —H(z 2 ) 
ncosa 

With the help of the above solutions and using Eqs. (3.58) and (3.63) we can evaluate fields 
at zi and z 2 

Zn 

E(zi) = E(z 2 )cos<5 + -H(z 2 )z sin 5 

ncosa 

and 


Z 0 H(zi) = E(z 2 )z'sin(5«cosa + Z 0 H(z 2 ) cosS 


The above equations can be written in matrix form 


pE(zO l = r cos5 ^1[E(Z 2 ) 1 -t* r E (z 2 ) 1 

Lz 0 H(z| )J Lwsinacosa cossj |_Z 0 H(z 2 )J |z<>H(z 2 )J 

For normal incidence, i.e. a = 0, the transfer matrix is 




cos 8 

znsiniS cos S I 


Special cases of importance are: 

1. The quarter-wave layer when hd = Xq /4; 8 = 7t /2 and matrix AT is 



2. The half-wave layer when nd = A.q/2; 8 = jt and matrix is 


^ X/2 


(3.65) 


(3.66) 


(3.67) 


(3.68) 


3.9 Bragg mirrors 


The Bragg mirror, also known as the Bragg reflector, consists of identical layers of dielectrics 
with high and low values of refractive indices as shown in Fig. 3.12. The main interest in 
fabricating such structures is that they have extremely high reflectivities at optical and 
infrared frequencies. They are important elements of VCSEFs where high reflectivity and 
bandwidth are required. A typical structure forming Bragg mirror consists of N layers of 
dielectrics with refractive indices hi (low refractive index) and hn (high refractive index). 
The ratio of those values, the so-called contrast ratio, plays an important role. 

The structure is known as a quarter-wave dielectric stack, which means that the optical 
thicknesses are quarter-wavelength long; that is =hi-ai = a 0 /4 at some wavelength 
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© E y 


Schematic of a seven-layer dielectric (Bragg) mirror; shown are N = 3 periods. 



(n-l) th cell nth cell 

Notation used in the analysis of Bragg mirrors. 


k 0 . The structure consists of an odd number of layers with the high index layer being the 
first and the last layers [7]. 

Here, we outline a formalism which is used in the design of Bragg mirrors. We also 
illustrate its applications with an example. We restrict our discussion to s-polarization (TE 
modes). Electric field vector is directed along y-axis E — E(x)-Oy, see Fig. 3.12 where we 
showed the orientation of electric fields and analysed structure. 

We develop transfer matrix notation which is based on the transfer of fields from cell n 
to cell n — 1. The cell is formed by two neighbouring layers which create a basic periodic 
structure which is then repeated some number of times. The electric field will now be 
expressed in the three layers of interest, see Fig. 3.13 for notation. 

In cell n one expresses the electric field as 


E y (x) = 


\A„ e *"(* -*") + B n 

i Q e -jk L (x-x p ) _|_ £) e jk L (x-x p ), 


(3.69) 


where k 2 H = (""") and k 2 = ( n '^) . One has also the following relations for coordinates 
of layers: x n = n ■ A, x„_i = (n — 1) • A, x p = n ■ A — an- Here A = cifj + a/, is the period 
of the structure. 
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The magnetic field vector is determined from the Maxwell equation, Eq. (3.1). For our 
geometry, one has the relation between electric and magnetic fields 
dEy(x) 
dx 

From continuity of electric field atx p and x„_i one obtains 

x = x p C„+ D n = A„ e ik "- a " + B n e~^ H aH 
x = x„ i A„_ i + = C„ e’ k ' ' a '- + D n e -* L aL 

Similarly, continuity of magnetic field produces equations 

x = x p -k L C n + k L D n = -k H A n e jk "' a " + k„B n e~i k » a » 
x = x n — i -k H A n _i + k H B n _ x = -k L C„ el*** + k L D„ e~^- 
The previous four equations can be written in matrix forms as follows: 

n iirc„ 

[k L -k L \ \D n 


(3.70) 


(3.71) 


(3.72) 


'I 1 lr^ilje^ 

k„ -k„\ [B n _ x \ [k L e^ a 


e Ma H I 

r.i.i 

-k H e jkHaH \ 

UJ 

e -jk L a L 1 

-he-fo* \ 

ft] 


The above equations transfer fields from cell n to cell n — 1. In the next step amplitudes 


ft] 


will be eliminated and the transfer process will be described in terms of amplitudes 
By doing so, one obtains the practical equation 


IB, 


(3.73) 


Matrix V" can be expressed as 

r = ¥ H .r L .^ L .r H ^ 

Matrix i describes propagation in a uniform medium having refractive index ni and 
thickness ai and has the form 


M*r a] 


(3.75) 


Similarly, matrix ^h describes propagation in a uniform medium having refractive index 
hh and thickness an- Matrices X h and / describe behaviour at interfaces and are 




k H + k L k H - k L ' 
2 k H 2 k H 

kH ~ k L k H + k L 
2 k H 2 k H . 
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Notation used in the definition of reflection from Bragg mirror. 


and 




-k-L + kn kL — kn~ 
2 k L 2 k L 

h ~ k H k L + k H 


(3.77) 


L 2 k L 2 k L J 

By referring to the Fig. 3.13 propagation (to the left) from an arbitrary point in the struc¬ 
ture, say point x n involves matrix T h (propagation over a cmiform region with «//, then 
propagation over an interface from region having n H to n L (described by matrix L ), prop¬ 
agation over uniform region with n L (described by matrix *7^ £) and finally, propagation 
over an interface from n L to nn (described by matrix H ). 

Matrix V" can be expressed as 


r =[“ 3 <3 - 7s) 

where the elements are 

a = e ,a " k " [cos a L k L + sin a/A J (3.79) 

b= r' "“(Irl)™”* <3 ' 80) 

c= r~*'(|-|) sin “ A <3 - 81) 


d = e- >a " k " [cos a L k L -L^jL + ^ sinazfc] (3.82) 

Matrix r is unimodular matrix, i.e. real square matrix with determinant ad — be = ±1 

[ 8 ], 

Referring to Fig. 3.14, the amplitude reflection coefficient is defined as 


(3.83) 


Choosing a column vector of layer one as the zeroth unit cell and remembering that all cells 
are identical, one obtains for N cells 
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Wavelenght (nm) 


Spectrum of reflectivity of a Bragg mirror. 


Since matrix ^ is a unimodular matrix, one has the following relation [8] 


pr bl N _ \a ■ Un -i 
L c |_c • Un- i 


(x) - U N _2 (x) b-U N -i(x) 

C x ) d ■ U N -i (x) - Um-i(x) 


where x = \{a + d) and U^{x) are Chebyshev polynomials of the second kind 

u _ sin [(to + 1) cos" 1 x] _ sin [(to + 1)0] 

Vl - x 2 sin0 

where x = cos 9. Using the above relations, the amplitude reflection coefficient is evaluated 


W [c-C/v-1 


(x) - Uv—2(x) b • £/jv_i(x) 
r-i W d • (x) — U N _ 2 (x) 


]E 


for cell A. From above relation, one finds 


c • U N -1 (x) 


A o a ■ U N -i(x) - U N - 2 (x) 

The definition of reflectance is [9] R = \r\ 2 . After a little algebra one finds 

|c| 2 


R = - 


(sinMKj\) 


Spectrum of reflectivity as given by Eq. (3.84) for N = 10 periods of Bragg reflector is 
shown in Fig. 3.15. The MATLAB code is shown in Appendix, Listing 3A.2. The following 
parameters were adopted: n# = 2.25, hl = 1.45, a# = 167 nm and a l = 259 nm. 
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The lateral displacement of a light beam on total reflection at the interface between two dielectric media 
(Goos-Hanchen shift). 


3.10 Goos-Hanchen shift 


Detailed examination of the internal reflection of a light beam on a planar dielectric interface 
shows that the reflected beam is shifted laterally from the trajectory predicted by the simple 
ray analysis, as shown in Fig. 3.16. The lateral displacement arises because wave energy 
actually penetrates beyond the interface into the lower index media before turning around. 
Total lateral shift is d = 2 z s whereas x s is penetration depth. 

To determine lateral shift d, we follow Kogelnik [10]. Consider incident beam A as 
consisting of two plane waves incident at two slightly different angles with wave vec¬ 
tors P ± Aft. The complex amplitude A (z) of the incident wave packet at the interface 
x = 0 is 


A{z) = e~ M+Ap)z + g-yW-Aflz 
= e~ iPz (e~ jApz + e> Apz ) 
= 2 cos (A fiz)e~ JPz 


Reflected beam B is 


(3.85) 

(3.86) 


where R — exp (2 j‘0). Phase shift 0 that occurs upon reflection is different for both types 
of polarization and has been determined before, see Eqs. (3.50) and (3.56). Phase shift 0 is 
a function of angle 9 and propagation constant fi. Assume a small value of A/i and expand 

d<b 

000 + A P)=<P 00) + -y’yA/f = 0 00) + A0 
dp 

Using the above expansion and (3.85) allows us to write reflected beam B as 

B _ R.A = e 2 MP± A P) j e -y(/S+A/S)z + e -j(f-Afi)zj 
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where z s = $jt. The net effect of the phase shift is to displace the beam along the z axis by 
a distance 2z s . Distance z s is different for both types of modes. For TE modes [11] 


koz s = 


tan 6 

p-4 m 


whereas for TM modes 


h z s = 


tan 9 1 

(£2 _ n 2)V2 p2/ n 2 f + p2/ n 2 _ X 


3.11 Poynting theorem 


Electromagnetic waves carry with them electromagnetic power. We will now deliver a rela¬ 
tion between the rate of such energy transfer and the electric and magnetic field intensities. 
We start with Maxwell’s equations (3.1) and (3.2) 

3B 

V x E =- (3.88) 

dt v 1 

3D 

V x H = — + J (3.89) 

3 1 

Use the following mathematical identity: 

V • (E x H) = H • (V x E) - E(V x H) (3.90) 

Substituting Maxwell’s equations into (3.90) and using constitutive relations, one obtains 

3H 3E 

V • (E x H) - -H a -E J - E a — 

3 1 dt 

13H-H , 13E-E 

= — [X, - (xE 2 — £ - 

2 dt 2 dt 

= ~ Q^ 2 + ^H 2 ) ~ °E 2 (3.91) 

Integrate (3.91) over volume V and apply Gauss’s theorem, and then one finds 

J V • (E x H )dv = -j t j \ sE2 + dY - / a£2dY ( 3 - 92 ) 

The Poynting vector P is defined as 
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where 

w e = \eE 1 2 is electric energy density 
w m = \liH 2 is magnetic energy density 
p„ = aE 2 is Ohmic power density 

From the above equation, one can interpret vector P as representing the power flow per 
unit area. 


3.12 Problems 


1. Using the expression derived for a transfer matrix, consider an antireflection structure 
consisting of two-layer quarter-wave-thickness films made of CeF^ (low index layer 
with refractive index n\ — 1.65) and high-index layer of ZrO-i (refractive index 
n = 2.1) deposited on glass substrate with refractive index n s = 1.52. Assume that 
on top there is air with n 0 = 1. Plot the spectral reflectance of such structure. 

2. Show that a linearly polarized plane wave can be decomposed into a right-hand and 
a left-hand circularly polarized waves of equal amplitudes. 

3. Derive an expression for a transfer matrix assuming p-polarization. In this case the 
vector of electric field E is parallel to the plane of incidence. 

4. Determine the reflectance for normal incidence and plot it versus path differ- 
ence/wavelength for various values of refractive index. 

5. Write a MATLAB program to illustrate propagation of EM wave elliptically po¬ 
larized. The wave will consist of two perpendicular waves of unequal amplitudes. 
Visualize propagation in 3D. Analyse the state of polarization (SOP) by changing 
phase difference 0 between individual waves. 

6. Analyse a Bragg mirror for TM polarization. Derive matrix elements of matrix V" 
for this polarization [9], 

7. Assume that for some materials refractive index at a particular wavelength is negative. 
Discuss the consequences of such an assumption. Consider modification of Snell’s 
law. 

8. What percentage of the incoming irradiance is reflected at an air-glass ( n g i ass =1.5) 
interface for a beam of natural light incident at 70° ? 

9. Write an expression for a right circularly polarized wave propagation in the positive 
z-direction such that its E-field points in the negative x-direction at z = 0 and t = 0. 

10. Verify that linear light is a special case of elliptical light. 

3.13 Project 


1. Write a MATLAB program to determine reflectance for a multilayered structure con¬ 
sisting of different dielectric materials of various thicknesses. Analyse the special case 

of d = X/2. 
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Table 3.2 List of MATLAB functions for Chapter . 

Listing 

Function name 

Description 

3A.1 

reflections-TE _TM .m 

Plots reflections based on Fresnel equations for TE and 
TM modes 

3A.2 

Braggjan.m 

Plots reflectivity spectrum of Bragg mirror for TE mode 
using an analytical method 


2. Develop MATLAB functions to analyse propagation of TM modes for arbitrary mul¬ 
tilayered waveguiding structure. Use a similar approach as described above for TE 
modes. 


Appendix A: MATLAB listings 


In Table 3.2 we provide a list of MATLAB files created for Chapter 3 and a short description 
of each function. 


Listing 3A.1 Program reflections_TE_TM.m. Program plots reflection coefficients for TE 
and TM modes. 


'/. reflections_TE_TM.m 

°/ 0 Plot of reflections based on Fresnel equations 

7, TE and TM reflections 

°/o cases of external reflection 


n = 1.50; 

N_max =11; 

t = linspace(0,1,N_max); 
theta = (pi/2)*t; 

7„ Plot reflection coefficients 
num_TE = cos(theta) - sqrt(n"2-(i 
den_TE = cos(theta) + sqrt(n"2-(i 
r_TE = num_TE./den_TE; 
num_TM = n~2*cos(theta) - sqrt(n~2-( 
den_TM = n“2*cos(theta) + sqrt(n~2-( 


% relative refractive index 
% number of points for plot 
7, creation of theta arguments 
% angles in radians 

(theta))."2 ); 

(theta)).“2 ); 

7. Plot for TE mode 
(theta))."2 ); 

(theta))."2 ); 


un_TM./den_TM; 


% Plot for TM mode 


7. 


angle_degrees = (theta./pi)*180; 

x_line = [0 max(angle_degrees)]; 7o needed to draw horizontal line 
y_line = [0 0]; 7. passing through zero 

plot(angle_degrees,r_TE,angle_degrees,r_TM, x_line, y_line, 

’LineWidth’,1.5) 
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xlabel(’Angle of incidence’,’FontSize 1 ,14) 

set(gca,’FontSize’,14); °/„ size of tick marks on both axes 

text(60, -0.6, ’r_{TE}’,’Fontsize’,14) 

text(60, -0.2, ’r_{TM}’,’Fontsize’,14) 

text(55, 0.1, ’\theta_{Brewster}’,’Fontsize’,14) 

pause 

close all 


Listing 3A.2 Program Bragg _an.m. Determines spectrum of reflectivity of a Bragg mirror 
for TE mode using an analytical method. 


I File name: bragg_an.m 

7. Determines reflectivity spectrum of Bragg mirror for TE mode 
7, using analytical method 
clear all 


N = 10; 
n_L = 1.45; 
n_H = 2.25; 
a_L = 259; 
a_H = 167; 

Lambda = a_L + a_H; 
lambda = 1000:10:2200; 
k_L = 2*pi*n_L./lambda; 
k_H = 2*pi*n_H./lambda; 

7. 


I number of periods 
l refractive index 
l refractive index 
l thickness (nm) 
l thickness (nm) 
l period of the structure 


a=exp(li*a_H*k_H),*(cos(k_L*a_L)+(li/2)*(k_H./k_L+k_L./k_H).*sin(k_L*a_L)); 
d=exp(-li*a_H*k_H).*(cos(k_L*a_L)-(li/2)*(k_H./k_L+k_L./k_H).*sin(k_L*a_L)); 
b = exp(-li*a_H*k_H),*((li/2)*(k_L./k_H - k_H./k_L).*sin(k_L*a_L)); 
c = exp(li*a_H*k_H).*((li/2)*(k_H./k_L - k_L./k_H).*sin(k_L*a_L)); 

7. 


K = (1/Lambda)*acos((a+d)/2); 

tt = (sin(K*Lambda)./sin(N*K*Lambda)). ~2; 

denom = abs(c).~2 + tt; 

R = abs(c)."2./denom; 

plot(lambda,R,’LineWidth’,1.5) 

axis([1000 2200 0 1.2]) 

xlabel(’Wavelenght (nm) ’, ’FontSize’,14) 

ylabel(’Reflectivity’,’FontSize’,14) 

set(gca,’FontSize’,14); % size of tick marks on both axes 

pause 

close all 
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Slab waveguides are important ingredients of both passive and active devices. Therefore, 
discussion and understanding of their properties are critical. The following topics will be 
discussed in this chapter: 

• ray optics of the slab waveguide 

• electromagnetic description 

• three-layer problem for symmetric guiding structure 

• TE modes in multilayer waveguide. 


4.1 Ray optics of the slab waveguide 


In this section we discuss applications of ray optics concepts to analyse the propagation 
of light in slab waveguides. We start with summarizing the concept of numerical aperture 
(NA). 


4.1.1 Numerical aperture 


Consider light entering a waveguide from air, having the refractive index n 0 to waveguiding 
structure with refractive indices n\ and «2 (see Fig. 4.1). We represent light within ray optics. 
We want to understand the conditions under which light will propagate in the middle layer 
with the value of refractive index n\. 

Ray 1 (dashed line) which enters waveguide at large incident angle 9\ propagates through 
the middle layer and penetrates the upper layer with refractive index m- Since this ray does 
not propagate in the middle layer, it is effectively lost. 

As we gradually decrease incident angle 9, we reach the situation where a ray slides across 
the interface (case for ray 3). In such a case incident angle 9 is known as the acceptance 
angle 9 a . The internal angle of incidence at point D is then <pc and it is determined from the 
relation 


sin0 c 


(4.1) 


For the incident ray (like ray 2) which enters the interface at an angle smaller than the 
acceptance angle 9 < 9 a , it will propagate in the middle layer. Let us analyse the existing 
situation in more detail. 
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From triangle ABC, one finds 02 = tc /2 — 0 (angle 62 not shown in Fig. 4.1). From 
Snell’s law for ray 3, we obtain 

no sin = n\ sin 02 = «i sin {tx /2 — <j>) = ri\ cos <j) = iq (l — sin 2 </>) 1/2 

From the above, we have (using Eq. 4.1) for ray 2 (propagating at the acceptance angle, i.e. 
for 0i = 6 a ) 

no sin0 fl = ifr (l - sin 2 0 C ) 1/2 = (n 2 - n ^) 1/2 
Numerical aperture (NA) is defined as 

NA = no sin0 fl = («, - n 2 2 )' 

We introduce relative refractive index difference A as 




A ~ 2 n\ 

The approximate expression for A is obtained by observing that n\ &n 2 , which allows us 
to write «i + «2 ^ 2«,. The approximate formula for A is therefore 

A _ (n 1 +n 2 ) (n 1 -n 2 ) ^ 2«i (771 -n 2 ) _ «i - n 2 
2n\ 2 n\ «i 

In terms of A, NA can be approximately expressed as 
NA = n x (2A) 1/2 


4.1.2 Guided modes 


We will first discuss the propagation of a light ray in a waveguide formed by the film of 
dielectric surrounded from below by substrate and by cover layer from above, see Fig. 4.2. In 
such a structure light travels in a zig-zag fashion through the film. Light is monochromatic 
and coherent and is characterized by w angular frequency and A. free-space wavelength. 
The following relation holds k = y ; , where k is known as the wavenumber. Electric field 
of the propagating wave is 

j? ^ g—jkrif(±x cos <f>+z sin </)) 
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n c cover 



n s substrate 


Light propagation in a slab waveguide in a ray approximation. 

Let p be a propagation constant of the guided mode of the slab. One has 

P = — = krif sin 0 (4.3) 

where v p is the phase velocity. Please note that not all angles are allowed (sometimes none). 
Referring to Fig. 4.2, rays at 0 = 90° travel horizontally along the waveguide. Generally, 
rays which undergo total internal reflection can travel within the waveguide. That condition 
can be fulfilled for rays travelling at angles fulfilling the condition (all angles are determined 
with respect to normal) 

0 C < 0 < 90° 


or 


nf sin0 c < ri f sin0 <rif sin90° = «/ 


where we have also multiplied the inequality by n /. Using Eqs. (4.1) and (4.3), one obtains 


n s < N < nf (4.4) 

Here we introduced ‘effective guide index’ N defined as 

N = ^ = «/sin0 (4.5) 


4.1.3 Transverse resonance condition 


The condition is expressed as ‘the sum of all phase shifts during propagation must be a 
multiple of 27r’. The following are the contributions to the phase change: 

knfh cos 0 phase change during first transverse passage through the film 
—20 c phase shift on total reflection from cover 

—20,, phase shift on total reflection from substrate 

Combining the above contributions, one obtains transverse resonance condition as 

2knfhcos(j) — 20, — 20 c = 2nv, v= 0, ± 1, ± 2,... (4.6) 

which is the dispersion equation of the guide. Variable v identifies mode number. 
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Expression for the Fresnel phase for TE polarization was derived in Chapter 3, Eq. (3.50). 
Modifying notation slightly (angle 0\ is replaced by 0), the relation reads 




sin z (p ~ n 2 

* «1COS0 

Adopting the above expression for our present geometry, one writes for phases on reflec¬ 
tions: 

1. for reflection on the film-cover interface 


■J n f sin 2 <t>-n 2 c Jnj sin 2 (p-n' c 

tan<p c — -— -— - 


2. for reflection on the film-substrate interface 


M / cos< ^ ^-n’sin^ 

:interface 

Jn 2 sin 2 (p - n] Jnj sin 2 <j)-n 2 

tan0 s =-----= . = 

"f™* ^/n 2 — «y-sin 2 d> 

4.1.4 Transverse condition: normalized form 


The above transverse condition will now be cast in a normalized form which is more suitable 
for numerical work. For that, let us introduce the following variables: variable V 


V = k ■ hjnl - n 2 


normalized guide index, b 


and asymmetry parameter for TE modes, a 


For TE modes, use transverse resonance condition and Fresnel phases, and obtain 


KV1— i = v • 7r + tan 1 


Example Derive transverse resonance condition in a normalized form, Eq. (4.12). 
Solution 

Using definitions (4.10) and (4. 1 1), one can prove the following relations: 

b _ N 2 — n 2 
b ~ nj-N 2 
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and 


and also 


b +a _N 2 -n 2 c 
T ~b ~ n 2 -N 2 



Using the above relations, the expressions for phases, Eqs. (4.7) and (4.8) can be written as 


and 


<p s = tan 1 


N 


N 2 — n 2 
nl-N 2 " 


lb+ a 

Y 1-4 


r~b~ 

V r ~b 


nf cos <p = yjn 2 — n 2 sin 2 cp = yjn 2 — N 2 
Substituting the above formulas into (4.6), we obtain 


A MATLAB program to analyse the transverse resonance condition (4.12) which plots 
normalized guide index b versus variable V, is provided in the Appendix, Listing 4A. 1. It 
allows us to input different values of the asymmetry parameter a. 

A plot of normalized guide index b versus variable V for three values of parameter a 
(a = 0, 8, 50) is shown in Fig. 4.3. 

We finish this section with a typical simplified expression for effective index N. From a 
definition of b one obtains 

b{n 2 f - n 2 ) + n 2 = N 2 

Perform the algebraic steps 
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V 

Normalized b — V diagram of a planar waveguide for various degrees of asymmetry. 

Finally, one finds 

N*n s + l -b(n f -n s ) (4.13) 


4.2 Fundamentals of EM theory of dielectric waveguides 


4.2.1 General discussion 


Assuming time-harmonic fields and using constitutive relations, the source-free Maxwell 
equations are 

V x E = -jean H (4.14) 

V x H = jme E (4.15) 

Boundary conditions were derived in Chapter 3. Separate fields into transversal and longi¬ 
tudinal components as 

E = E, + E z , H = H, + H z (4.16) 


where 

E, = [E x , E y , 0] 

is the transverse part and 

E z = [0, 0, E z \ 
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is the longitudinal part. Also 

V = V t + a z |-, a z = [0, 0, 1] (4.17) 

dz 

where a z is a unit vector along z-direction. Substituting (4.16) and (4.17) into Eqs. (4.14) 
and (4.15), one obtains 

V, x E, = -jcof .iH (4.18) 

V,xH,= jcosEz (4.19) 

V,xE : + a z x^ = -JtofiHt (4.20) 

az 

V, x H z + a z x ^ = jeoeEt (4.21) 

az 

Modes in a waveguide are characterized by a dielectric constant 

e(x,y) = e 0 n 2 (x,y) (4.22) 

where n (x, y ) is the refractive index profile in the transversal plane. Write the fields as 

E (x, y, z) = E v (x, y) e~^ z (4.23) 

H (x,y,z) = H v (x,y)e~J^ 

where we have introduced index v which label modes and p v is the propagation constant 
of the mode v. After substitution of Eqs. (4.23) into (4. 1 8)—(4.21) one obtains 


V t x E tv (x,y) = —jconll~ v (x, y) (4.24) 

V, x H n (x, y) = jweE zv (x, y) (4.25) 

V t x E zv (x, y) — jp v a z x E (u (x,y) = —jajyiW n: (x, y) (4.26) 

V t x H zv (x,y) - jp v a z x H ; , (x,y) = jcoeE tv (x,y) (4.27) 


From the analysis of the above equations, the existence of several types of modes can be 
recognized. Those will be described in more details later on. Generally, the main modes are: 

• guided modes (bound states) - discrete spectrum of p v 

• radiation modes - belong to continuum 

• evanescent modes-/J v = — jay, they decay as exp (—a v z). 

4.2.2 Explicit form of general equations 


Using the following general formulas 


V,xE t = 


d_ 

dx 


E x 


a v 

_a_ 

dy 

Ey 


a z 
0 
0 


0 E z 
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a z x E, = 0 0 1 

E x E y 0 

where a*, a v , a z are unit vectors along the corresponding directions, we write general 
Eqs. (4.24)-(4.27) in expanded form (dropping mode index) 

T dE v dE x I 

°. °, I m-jeon [0, o, H z ] (4.28) 

r dH v dH x i 

I °, °, - -J 1 I* Jcoe [0, 0, E z \ (4.29) 

-jp[-Ey,E x , 0] = -jcon[H x ,H y , 0] (4.30) 

=jeoe[E x ,E y ,0] (4.31) 

In the following sections the previous general equations will be used to analyse specific 
situations. 


4.3 Wave equation for a planar wide waveguide 


For the wide waveguide where its dimension in they-direction is much larger than thickness, 
the field configuration along that direction remains approximately constant, see Fig. 4.4. 
Therefore, we can consider only confinement in the x-direction and set 



Also, the refractive index assumes only x-dependence 
n = n (x) 


Such a waveguide supports two modes: 

• transverse electric TE, where E y ^ 0 and E x = E z = 0 

• transverse magnetic TM, where H y ^ 0 and H x = H z = 0. 
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From general formulas given by Eqs. (4.28)-(4.31), one obtains a set of three equations 
describing TE modes and also three equations describing TM modes. Those are written 
below for both types of modes separately. 

TE modes 

To describe TE modes we use only equations which involve E y and its derivatives. One 
observes that when E x = E z = 0, from Eq. (4.30) H y = 0. Equations describing those 
modes are 

pE y = —6jfiH x (4.32) 

dl-L 

+jpH x = -j(oeE y (4.33) 

( 4 - 34 ) 

dx 

The last step will be to eliminate H x and H z . We will differentiate the third equation, then 
substitute for ^ using the second equation and finally eliminate H x using first equation. 
We obtain a wave equation for TE modes 

32 p 

d -^ = (P 2 -nW)E y (4.35) 

where k = “ = co^/sqJZo- 

TM modes 

We proceed in a similar way as for the TE modes. From Eq. (4.31) when H x = H z = 0, 
one obtains that E y = 0. We keep only equations which involve H y and its derivatives. 
Equations which describe TM modes are 

PH y = coeE x 
dH y ■ z, 

17 " imE - 

dE 

— + jPE x = jconHy 

dx 

We will use the fact that e = e 0 n 2 and eliminate E x and E z . Here one must be more 
careful because n 2 can be x dependent. From the first equation we determine E x and from 
the second equation we determine E z . Substituting the results into the third equation gives 
the wave equation for TM mode 

<436) 


4.4 Three-layer symmetrical guiding structure (TE modes) 


We will analyse the three-layer symmetrical structure for TE modes. The structure is as 
shown in Fig. 4.4. The details are illustrated in Fig. 4.5. A film layer of thickness 2 a and 
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refractive index «/ is surrounded by a substrate and cladding layers have refractive indices 
equal to n 0 . 

We introduce the following notation: 


k 2 = n 2 k 2 — f} 2 

(4.37) 

and 


y 2 = P 2 — "Ik 2 

(4.38) 

Here, y is determined by the value of refractive index around the film. In general, y assumes 
values which are dictated by the values of refractive indices in the cladding and substrate. 
Therefore, one deals with more than one value of y. 

The wave equation is 

We discuss two separate solutions: odd and even modes. 


Odd modes: 

We assume the following guiding solution: 


E y (x) = A c e~ y(x ~ a) , a<x (cladding) 

Ey(x) = B sin KfX, —a < x < a (film) 

E y (x) = A s e y(x+a) , x<-a (substrate) 

(4.39) 

At the interfaces, i.e. for x = ±a, the field E y and its derivative ^ 
The continuity at x = a gives the following equations: 

must be continuous. 

A c = BsmiCfa 
—yA c = KfB cos KfCi 


From above equations, one obtains 


— y=Kf COt Kfa 

(4.40) 


The continuity at x = —a gives an identical equation. By introducing variable 


y = K f a 

Eq. (4.40) can be written as 


—ycoty = ya 
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Graphical solution of the transcendental equation for even modes. 


From the definition for y we obtain 

—ya = a,/ n 2 k 2 — fi 1 = -J a 2 n 2 k 2 — a 2 /? 2 (4.41) 

Similarly, from the definition for Kj we obtain 


Determining a 2 / 3 2 from the above and substituting into Eq. (4.41), one finds 
—y coty = V R 2 —y 2 


where we have defined 

R 2 = a 2 k 2 (n} - n 2 ) 
Transcendental Eq. (4.42) must be solved numerically. 

Even modes: 

For even modes we assume the following guiding solution: 


E y (x) = A c e-r (x ~ a) , a < x (cladding) 

E v (x) = B cos KfX, —a < x < a (film) 

Ey(x) = A s e Y<x+a> , x <—a (substrate) 


(4.42) 

(4.43) 


(4.44) 


The remaining steps are done in the exactly same way as for odd modes. The resulting 
transcendental equation for propagation constant is 

ytany = y/R 2 - y 2 (4.45) 


The function R is defined by Eq. (4.43). Eq. (4.45) is represented in Fig. 4.6. Intersections 
between line h which represents the right hand side of Eq. (4.45) and lines g'.y which 
represent the left hand side of Eq. (4.45), determine propagation constants. Those must be 
found numerically. For numerical searches, the following functions are introduced: 
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The above functions are used in numerical search, since e.g. f eve n(y \) = 0 corresponds 
to the solution y\ from which propagation constant P\ is determined. The algorithm is 
described in the next section. 


4.4.1 The algorithm 


The following main steps constitute the numerical algorithm. 

1. Search intervals for even and odd functions are formed as follows: 
even functions 

y t = nn,y f = min (nit + * - 10" 3 , R) 

odd functions 

y t = — + nn,yf = min((n + 1 )tt — 10 -3 , R) 

where y, and yj are initial and final values of the appropriate search interval. The small 
constant (10 —3 ) has been introduced to avoid working close to singularity. 

2. Search is done for zeros in each interval. 

3. Even and odd solutions are obtained independently in separate searches. 

4. The solutions are found using MATLAB function fzeroQ as 

ytemp = fzero(func , [y;,y/]) 

where the first argument contains search function and the second one contains interval for 
search. 


4.5 Modes of the arbitrary three-layer asymmetric planar 
waveguide in ID 


Consider the modified version of the structure shown in Fig. 4.5. The structure has different 
values of refractive indices of substrate and cladding and it is known as asymmetric planar 
waveguide. Here, n c signifies refractive index of cladding,«/ of film, and« v that of substrate. 
For an asymmetric slab, n c f n s . Define the following quantities: 

K 2 = nlk*-P = -y? 

k] = njk 2 - p 2 (4.48) 

xf = — p 2 = —y 2 

where y, describes transverse decay and /c, contains propagation constants, i takes the 
values c, /, s as appropriate. In the next subsection we discuss TE modes for this three- 
layer structure. 
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4.5.1 TE modes 


For guided TE modes the following solutions exist: 

E y (x) = A c e~ yc ^ a) a < x 

E y (x) = AcosKfX+ BsiniCfX —a < x 

E y (x) = A s e y * (x+a) x < -a 

Derivatives are determined as follows: 

dEy(x) -y c (x-a) 

-= — y c A c e ' cK ’ a < x 


dx 

dE y (x) 


(cover) 

(film) 

(substrate) 


(cover) 


= — KfA cosAyx + Kf BsiniCfX —a < x < a (film) 

- = yA,e K<x+a> x < —a (substra 


(4.50) 


Boundary conditions dictate that 

E v and —are continuous for x = a and for x = —a (4.i 

dx 

When applying boundary conditions for E y and dF ^ x) , we get the following equations: 


forx=— a A cos Kfa — B sin Kfa = A s 

k fA sin k fa + k fB cos k fa = y s A s 
for x = a A c = AcosKfd + BsinKfd 

—y c A c = —kjA sin Kja + KjB cos Kfa 
The above equations can be written in a matrix form 


cos Kfa 

—sin Kfa 

-i 

0 " 

~A~ 

Kf sin Kfa 

Kf QOS Kfa 

~Ys 

0 

B 

cos Kfa 

sin Kfa 

0 

-1 

As 

_— Kf sin Kfa 

Kf QOS Kfa 

0 

Yc. 

_A C _ 


For the above homogeneous system to have nontrivial solution, the main determinant should 
vanish. 


cos Kfa —sinAT/a —1 0 

KfsinKfa Kf cos Kfa —y s 0 

cos Kfa sinAT/a 0 —1 

— KfsinKfa Kf cos Kfa 0 y c 


(4.53) 


The above determinant is evaluated as follows. Let us expand it over last column and we 
get 



cos Kf a 

—sin Kfa 

-i 


cos Kfa 

—sinAya —1 

Yc 

Kf sin Kfa 

Kf COS Kfa 

-Ys 

+ 

Kf sinAya 

Kf cos Kfa —y c 


cos Kf a 

sin Kfa 

0 


—Kf sin Kja 

Kf cos Kfa 0 


Evaluating both determinants, we obtain 
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which can be expressed 



(4.54) 


It is the general equation for a three-layer asymmetric waveguide. For a symmetric 
waveguide 


Ys = Yc = Y 


and one can write Eq. (4.54) as 



(4.55) 


which describes even and odd modes discussed in the previous section (symmetric modes). 

Note: The above modes are not normalized for power. Power P carried by a mode per 
unit guide width is determined as follows: 





= Ef-Hfheff 


(4.56) 


where h e ff = 2a + — + — is the effective thickness of the waveguide. 


4.5.2 Field profiles for TE modes 


Analysis of the general determinant given by Eq. (4.54) for the asymmetric waveguide 
is complicated. If one wants to obtain formulas suitable for numerical evaluations and 
also suitable for obtaining field profiles, a better approach is to eliminate all constants 
appearing in Eq. (4.49). Thus, one will have a field profile expressed in terms of one 
constant only, say A s . One then needs to determine derivatives which should be continuous 
across interfaces. From continuity of derivatives, the relevant transcendental equation used 
to obtain propagation constants is obtained. 

We consider asymmetric structure where the substrate-film discontinuity is at x = 0 and 
film-cover interface is located at x = h. From continuity of the TE field at x = 0 and x = h 
one finds 


A c = A cos Kfh + BsinKfh 


(4.57) 


and 


A,=A 


( 4 . 58 ) 
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j Table 4.1 Asymmetric three-layer waveguide. j 

Refractive index 

Thickness 

n c = 1.40 

_ 

n f = 1.50 

5 pm 

n„ = 1.45 

- 


From continuity of derivatives at the above points 


and 


— Yc^c — —KfA sin Kfh + KjB cos Kfh 

(4.59) 

Y s A s = KfB 

(4.60) 


From Eqs. (4.58) and (4.60) the constants A and B can be expressed in terms of A s . 
Substitution of those expressions into Eq. (4.57) gives constant A c in terms of A s . Using 
those results, we can replace constants A, B and A c in Eq. (4.49). The resulting equations 
are 

E y (x) = A s (cosKfh + ^ sin KfhJ exp [— y c (x — h)] h < x 

E y (x) = A s (cosKfX+ & sinx/x^ 0 <x<h (4-61) 

E y (x) = A s exp (y s x) x < 0 


From the above, the derivatives with respect to x are 

E' y {x) = —A s y c (cosKfh + ^ sinA -ftij exp [— y c (x — A)] h < x 
E' v {x) = A s (—Kf sin/c/ x + y s cos K/x) 0 <x<h (4-62) 

E' y (x) = A s y s exp (y s x) x < 0 

Applying continuity of derivatives at x = h, one finds 

—y c ^cosKfh + — sin K/h^j = —Kf sin k/ h + y s cos kj h 
From the above, it follows the following transcendental equation: 

tan K f h = — Ys + Yc , (4.63) 

Kf ~ YcYs/Kf 

The above equation is used in numerical search for propagation constants. A typical plot 
of Eq. (4.63) is shown in Fig. 4.7. Using those propagation constants, from Eq. (4.61) one 
finds field profiles. 


Example We consider the three-layer asymmetric planar structure described in Table 4.1 
(from [1]) operating with light of wavelength k = 1 pm. 
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1 Table 4.2 Propagation constants for 

a three-layer asymmetric structure defined in Table 1 

Mode 

Propagation constant/J(p.m 1 ) 

TE 0 

9.40873 

TEi 

9.36079 

te 2 

9.28184 

TE 2 

9.17521 



Graphical plot of Eq. (4.63) for the three-layer asymmetric waveguide defined in Table 4.1 . 


The structure is analysed with MATLAB code from Appendix 4A.2.1. The resulting 
propagation constants are summarized in Table 4.2. There is a good agreement with pub¬ 
lished results [1], Field profiles for two modes are plotted in Fig. 4.8. 


4.6 Multilayer slab waveguides: 1D approach 


In this section we discuss slab waveguides consisting of more than three layers. 
Multilayers require the repeated applications of boundary conditions at the layer 
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Field profiles for an asymmetric three-layer structure. Fundamental mode TE 0 (left). Mode TE X (right). 



Multilayer guiding structure. Electric field profile is also shown. 


interfaces. Schematically, the structure is as shown in Fig. 4.9. We make the following 
assumptions: 

• ID problem 

• n = n(x) is the refractive index 



Discussion of TE and TM modes is done separately. We start with TE modes. 
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4.6.1 TE mode 


The initial equations were derived previously, Eqs. (4.32)-(4.34). We write them here for 
easy reference: 

dE y 

— = 

-pE y = ojiM)H x (4.64) 

8H- 

-7 + jPH x = -j(on 0 eoe r E y 

dx 

The above system can be combined together to obtain a wave equation, as 

d 2 E y an r 

—r- = -jco/iQ ^ from first Eq. 

dx 

= —jwfx o (—jPH x - j(osoe r E y ) from third Eq. 

= -jco/x o | —jP ~ j(oso£ r Ey^ from second Eq. 

= P 2 Ey-co 2 fioe 0 s r E y 

Finally, one writes wave equation for ‘i’ layer as 

^E yi (x) = (p 2 - e 0 n 2 ) E yi (x) 
where k% = ixos 0 co 2 and e ri = n 2 . Introduce 

k 2 = kin 2 - p 2 

The solution to the wave equation is 

E yi ix) = A i e- jKi(x - x ‘-' ) + 

At the interfaces of layers, the boundary condition of TE mode are 

Ei(x) \ x=Xi _ t = E t _ iC*)^ 

BEtOe) , _ BEi -1 , 

dx dx ' x ~^-' 

It is more convenient to work with another set of variables U^x) and Vj(x) defined as 


(4.65) 

(4.66) 

(4.67) 

(4.68) 

(4.69) 


Ui(x) = E^x) (4.70) 

Viix) = wfiMx) (4.71) 


From Eqs. (4.64) one obtains equations for new variables. Directly from the first of 
Eqs. (4.64) 


dUjjx) 

dx 


= -JVi(x) 


(4.72) 


Using the second equation to eliminate H x in the third of Eqs. (4.64), one has 


d f + J^E,= 

OX (Of! 0 


—j(OfiQ£oe r E y 
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For layer i — th 


-- ~jkln)E y 


-jp 2 U i (x) = -jk 2 0 n*U i (x) 


- = -jK-UAx) 


Eqs. (4.72) and (4.73) can be written in a matrix form as 

dfewy o -n rwx)] 

dx [Vdx)] 1-jK? 0 J [v,(x)\ 


From Maxwell’s equations one can verify the following relation: 
,dUj(x) dE vi {x) 
j dx j dx 

Using the above relation, the solutions of Eqs. (4.74) are 


- = aJii Q H zi (x) = Vi(x') 


Ui(x) = A ie -^^ Xi) + B,e JKi{x ~ Xi 
dUi{x) 


They can be written in a matrix form as 


') - Bi e jK ‘ ( *- Xl) } 

ES]-C=t]K) 


From the above, for x, one finds 


EH: 'IS 

ives the expression for coeffici 

KMC 4 ]e;] 


Inverting the last equation gives the expression for coefficients in terms of field and its 
derivative 


Substituting the last equation into (4.75) gives 


ru,(x)i r e*<*-*) 1 1 \i ± 1 m-fe)] 

Ld-(x)J L - Kie i«i<*-*d\ 2 \i -4.J Ld-(x,)J 

■ r "KS] 


(4.76) 
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Explicitly, propagation matrix *7\ (x,) at point x, is 


If {xd 


COS Kj(x — Xi) 

—jKi sin Kj(x — xi) 


— sin/q(x - x t ) 
cos #q(x — Xi) 


(4.77) 


Equation (4.76) ‘propagates’ values of the fields from point x, to an arbitrary point x within 
layer i. 


4.6.2 Propagation constant 


A propagation constant is obtained if we propagate fields across all interfaces. First consider 
propagation between two consecutive interfaces, that is propagation from x, location to x,+1. 
One finds from Eq. (4.76) 

sin/Ci(x !+ i - Xi) \Ui(Xi)\ 
C0SX,(X,' + 1 - Xi) JL^fe)J 

Let us define the thickness of layer i as 


p + i(x, + i)l _ T cosx,(x i+ i - Xi) 
\_V i+ ] (x,+i) J I -jKt sin Ki(x i+ \ - xt) 


di=X i+ l -Xi 


The propagation matrix which propagates fields from x, to x i+ \ 
form 


tt = 


COS K t di 
—jKi sin Kjd, 


— d sin Kidi 
COS Kidi 


s called AT) and has the 


(4.78) 


Using the above formalism, we can now propagate fields through the entire structure from 
substrate to a cladding region as follows: 



[mil m n~\ rt/,1 

|_m 2 i m 2 2] 


(4.79) 


In the above we have introduced the 2 x 2 matrix with elements m\\, mn, m 2 \, m 22 
which plays central role in the numerical process of determining propagation constants. 
From the above equation one obtains 


U c = m n U s +m u V s (4.80) 

V c = m 2l U s + m 22 V s (4.81) 


In the substrate and cladding, the fields decay exponentially (Fig. 4.10). Therefore, the 
expressions for fields in the cladding and substrate regions are 
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substrate waveguide cladding 

Schematic of the field distribution. Exponential decay in the substrate and cladding regions is shown. 


• forx < x\ (substrate) 

U s (x) = A s exp[ K s (x-xi)] (4.82) 

V s (x) = jy s A s ex V [y s {x - x x )} (4.83) 

• for x > xn (cladding) 

U c {x) = A c exQ[-y c {x-x N )-\ (4.84) 

V c (x) m —jy c A s exp[—y c (x - x, v )] (4.85) 

In the above expressions we defined 

y- = P 1 - (4.86) 

Yi = M 

From expressions (4.82) and (4.83) one obtains at point xi 

U s (x ,) = A s (4.87) 

VAxi) = jy s A s (4.88) 

In a similar way for point x ; y one has 

U c (xn) = A c (4.89) 

V c (xn) = -JYcA c (4.90) 

Substituting the above results into Eqs. (4.80) and (4.81), one obtains the expressions which 
are used to find propagation constants employing numerical procedure 

ycy s mi2 - rn 2 \ = j(y c mn + y s m 22 ) (4.91) 

Values of matrix elements m\\,m\ 2 , m 2 \,m 22 are obtained numerically for a particular 
guiding structure. 


4.6.3 Electric field 


Once propagation constants are found, one can obtain profile of electric field for each mode. 
To do this one uses Eq. (4.76), which after multiplication of matrices is 


TE/,(x)l = I" cos Ki(x - xd - i sin /c,(x - x,-)l ["L^-fx,-)] 
L v i(x)\ l-jKi sin Kj(x - Xi) cos k, (X - Xi) J \_Vj (x,)J 
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where for layer i x,- < x < x I+ 1. The above determines electric field at an arbitrary location 
within layer i once the values of fields are known at point x,-. In practice one assumes some 
value of electric field at point x\ and using expression (4.92) determines field within all 
layers. The above procedure will be applied to analyse several guiding structures in the next 
section. Before this, we will formulate basic equations for TM modes. 

4.6.4 TM modes 


The analysis for TM modes follows exactly the same path as for TE modes. Here we outline 
basic equations only. For TM modes one has the following relations: 


(4.93) 


For each layer one has 


Define variables 


pH y = coeE x 

d R v . r 
—- = jcoeE z 
3x 

+ jfiE x = jcofiHy 


U = Hy 
V m cosoE z 


They fulfill the following equations: 

dU_ 

dx 




(4.96) 


From this point one follows a similar approach as for TE modes. The remaining analysis is 
left as a project. 


4.7 Examples: 1D approach 


In this section we will consider several waveguiding structures described in literature and 
apply our formalism to analyse those structures. The tests were performed in searching for 
propagation constant ft and then determining the profile of electric field. Our tests were 
performed for several structures analysed in References [2], [3], [4] and [5]. 

We attempted to find all modes. Excellent agreement was found as compared with the 
published results. 
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1 Table 4.3 Geometry of a four-layer lossless dielectric waveguide [ ]. j 

Layer 

Refractive index 

Thickness 

Description 

D 6 

n 6 = 1.00 

1 pm 

Cladding 

d s 

n 5 = 1.66 

0.50 pm 


d 4 

n 4 = L53 

0.50 pm 


d 3 

n 3 = 1.60 

0.50 pm 


d 2 

n 2 = 1.66 

0.50 pm 


Di 

n = 1.50 

1pm 

Substrate 


Table 4.4 Propagation constants for a four-layer lossless dielectric waveguide defined in Table . [ 

Mode 

Propagation constant 

TE 0 

1.622728682325434 + 0.000000000000441/ 

TE\ 

1.605275698094530 + 0.000000000000014/ 

te 2 

1.557136152293222 + 0.000000000000436/ 

te 3 

1.503587112022723 


The list of MATLAB files is provided in Table 4.12 * 1 . All MATLAB files are listed in 
Appendix 4A.3. 


4.7.1 Four-layer lossless waveguide 


The first tested structure is the one considered by Chilwell and Hodgkinson [3] and also by 
Chen et al. [4], It is a four-layer lossless dielectric structure shown in Table 4.3. Wavelength 
is A. = 0.6328 pm. Determined propagation constants for all four modes are summarized 
in Table 4.4. They show an excellent agreement with the results published in Ref. [3] and 

[4]- 

Electric field for fundamental TE-mode has been calculated using mesh points generated 
by function meshjc.m. The resulting profile of TE fundamental mode is shown in Fig. 4.11. 


4.7.2 Six-layer lossy waveguide 


The second structure tested by our program is defined in Table 4.5, from [4], It is a six- 
layer lossy waveguide structure operating at the wavelength X = 1.523 pm. Propagation 
constants for this structure are summarized in Table 4.6. Obtained results show an excellent 
agreement with the published data. 

The TE electric field distribution of the fundamental mode for this structure is shown in 
Fig. 4.12. 

1 I acknowledge the help of my former student, Mr L. Glowacki, in developing some of those routines. 
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I Table4.6 Results of propagation constants for six-layer lossy dielectric waveguide. j 

Mode 

Propagation constant 

TE 0 

3.460829693510364 + 0.072663342917385i 

TE i 

3.316707802046375 + 0.023275817588121i 

te 2 

3.208555428734457 + 0.012782067986633i 

te 3 

3.195490593396514 + 0.012585955654404i 



Field profile of fundamental TE mode for four-layer lossless structure. 


4.7.3 Structure by Visser 


This structure was described by Visser et al. [5], It is summarized in Table 4.7. 

The results of calculations of propagation constants for TE modes are summarized in 
Table 4.8. They are identical to the values published by Visser et al. [5]. The profile of the 
electric field for mode TE\ is shown in Fig. 4.13. 
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Table 4.8 Propagation constants obtained forthe structure shown in Table 


Layer Propagation constant 


TE 0 3.503443332950034 - 0.007103000978683! 

TEi 3.337286858209064 + 0.000229491103731! 

TE 2 3.251685206983383 + 0.000530514779912! 

TE 3 3.104251421414573 - 0.001337986339752! 

TE 4 2.878636779881233 + 0.000173729890361! 

TE 5 2.628139320470185 - 0.001548644353782! 

TE 6 2.243951362601185 - 0.000708377958009! 

TE 1 1.768190960412424 - 0.001353217183860! 

TE a 1.074262026525778 - 0.002457891473570! 



x (microns) 


Field profile of fundamental TE mode for a six-layer lossy waveguide. 


4.8 Two-dimensional (2D) structures 


In the previous sections we have discussed one-dimensional guiding structures, also 
known as planar waveguides. However, in practical devices the waveguides are essentially 
two-dimensional (2D) In those structures, refractive index n(x,y) depends on transverse 
coordinate x and lateral coordinate y. Two examples of popular structures are shown in 
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Field profile of TE X mode for structure discussed by Visser etal. [5], 




(a) (b) 

Popular types of 2D waveguides, (a) Buried-channel waveguide, (b) Rib waveguide. 

Fig. 4.14. They exhibit distinctive features of their refractive index. The structures shown 
are known as the buried channel waveguide and the rib waveguide. 

In what follows we will discuss one of the methods of obtaining approximate solution 
for propagating mode, known as the effective index method. 

4.8.1 Effective index method 


To illustrate the effective index method [6], let us start with a scalar wave equation in 2D 
which is 


d 2 <t>(x,y ) 
dx 2 


3 2 <t>{x,y) 
3 y 2 


+ ^ [e r (x,y) - N 2 ff ^ (p(x,y) = 0 


(4.97) 


Here is the effective index which we want to determine. Assume that there is no 
‘interaction’ between variables x and y. This allows us to separate fields as [7] 


<fi(x,y)=X(x)-Y(y) 

Substitute postulated solution into wave equation, evaluate the relevant derivatives and 
divide both sides of the resulting equation by X(x) ■ Y(y). One obtains 
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Step 1 



The above is equivalent to 

Step 2 



Ni N 2 N 3 


Step 3 


Ni N 2 N 3 

Illustration of the effective index method. 

Introduce separation constant k%X 1 2 3 4 as 

^Y"(y) + ^s r {x,y) = l^X 2 (4.99) 

The remaining terms are 

-^-X"(x) - klN 2 eff = -kp} (4.100) 

A (X) 

Based on the above theory it is possible to design a method, known as effective index 

method to determine effective index IvSy The method (see Fig. 4.15) consists of several 

steps: 

1. Replace the 2D waveguide with a combination of ID waveguides along x-axis. 

2. Solve each of the ID problems separately by calculating the effective index along y axis 
and obtain N e ff in each case (for our structure three cases). 

3. Construct a new ‘effective’ ID waveguide (along x-axis) which will model the original 
2D waveguide. The new effective waveguide has the values of refractive indices as 
indicated. 

4. Determine an effective index by solving the ID waveguide constructed in Step 3 along 
the x-axis. 
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Wavelength 


Refractive indices 


Geometry 


n f = 2.234 
n„ = 2.214 


h = 1.8 pun 
d — 1 p.m 
w — 2 pm 




h 


Structure used as an example of the application of the effective index method. 

An example 

We illustrate the effective index method with an example can be found in [8]. The 
structure is shown in Fig. 4.16 and the numerical parameters in Table 4.9. 

The method proceeds with the following four steps: 

1. Determine normalized thicknesses using the definition of V 



Using the values from Table 4.9, one finds 




One obtains 

N f = 2.227, Nd = 2.218 

3. The same structure along the ^-direction is approximated as the one-dimensional, using 
the previous values. The V number is determined using 
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One obtains 


V eg = 3.14, b eg = 0.64 


(4.102) 


4. Finally, the approximate effective index N = N eq of the equivalent guide is determined 
using 



(4.103) 


For this example, we obtain N = 2.224. 

More examples of the applications of effective index method can be found in books by 
Coldren and Corzine [9] and Liu [10]. 


4.9 Problems 


1. Find the effective refractive indices and the number of TE modes in an asymmetric 
waveguide with an asymmetry parameter a = 10 and thickness h = 1.50 qm. Assume 
A. 0 = 0.82 |um. 

2. Compute the largest thickness that will guarantee single TE mode operation of the 
above waveguide. 

3. Analyse the three-layer, asymmetric guiding structure. Derive a transcendental equa¬ 
tion for the propagation constant. 

4. Conduct analysis of a four-layer waveguiding structure [8], Derive equations for all 
elements of matrix M. Obtain an analytical formula for dispersion relation. 


4.10 Projects 


1. Based on theoretical results developed for the three-layer symmetrical guiding structure, 
write a MATLAB program which will solve transcendental equations using functions 
given by (4.46) and (4.47). Select from the literature an appropriate waveguiding struc¬ 
ture. Use your program to determine propagation constants and profiles of electric fields. 
Compare your results with those from the literature. 

2. Analyse TM modes for the three-layer asymmetric waveguide defined in Table 4.1 . Ob¬ 
tain analytical solutions in all regions and derive an appropriate transcendental equation. 
Write a MATLAB program to determine propagation constants. 

3. Plasmonic waveguide contains a metallic layer. The simplest cases are the three-layer 
structures: metal-dielectric-metal (MDM) and dielectric-metal-dielectric (DMD). Derive 
a transcendental equation describing plasmonic waveguide (similar to Eq. 4.45 for a 
symmetrical dielectric guiding structure). Consult the paper by Kekatpure et al. [11], 
With the equation at hand, write a MATLAB program to determine propagation constants 
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for TM modes for the MDM structure defined in Table 4.10 [11]. Analyse the obtained 
results. 


| Table 4.10 Plasmonic structure. j 

Material 

Thickness (pm) Relative permittivity 

gold 

3 -95.92 - 110.97 

silica 

50 2.1025 

silver 

3 -143.49-19.52 


4. Conduct analysis of the multilayered structure for TM modes. Derive all necessary 
equations. Write a MATLAB program to determine propagation constants and field 
profiles. 


Appendix A: MATLAB listings 


In Tables 4.11 and 4.12 we provide a list of MATLAB files created for Chapter 4 and a 
short description of each function. In Table 4.12 we summarize a list of files used to analyse 
planar waveguides. 


Table 4.11 List of MATLAB functions for Chapter . 

Listing 

Function name 

Description 

4A.1 

bJ' jdiagram. m 

Plots b-V diagram for planar waveguide 

4A.2.1 

aiL.m 

Analysis for three-layer planar waveguide 

4A.2.2 

funcMsym.m 

Function used by a3L.m 



Table 4.12 

Functions used to analyse a 1D planar waveguide. 1 

Listing 

Function name 

Description 

4A.3.1 

slab.m 

driver function (determines prop, constants andfield profile) 

4A.3.2 

lossless.m 

data for waveguide structure without losses 

4A.3.3 

lossy, m 

data for waveguide structure with losses 

4A.3.4 

visser.m 

data for structure described by Visser 

4A.3.5 

muller.m 

implements Muller’s method 

4A.3.6 

f-TE.tn 

creates transcendental equation for TE field 

4A.3.7 

meshjc.m 

generates ID mesh 

4A.3.8 

refindex.m 

assigns the value of ref. index to each mesh point 

4A.3.9 

TEfield.m 

determines TE field for all mesh points 
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Listing 4A.1 Program b_V_diagram.m. MATLAB program to analyse the transverse 
resonance condition (4.12) and to plot a normalized guide index b versus variable V. 
Allows to input different values of the asymmetry parameter a. 


% File name: b_V_diagram.m 

7. function which plots b-V diagram of a planar slab waveguide 
clear all 

N_max = 400; 7« number of points for plot 

b = linspace(0,1.0,N_max); 
hold on 

for nu = [0, 1, 2] 

for a = [0.0, 8.0,50.0] ; 7, asymmetry coefficient 

7. determine V 

VI = atan(sqrt(b./(l-b)) ); 

V2 = atan(sqrt((b+a)./(1.0-b))); 

V3 = 1./sqrt(1.0-b); 

V = (nu*pi + VI + V2).*V3; 

% 


plot(V,b,’LineWidth’,1.2) 
grid on 

axis([0.0 20.0 0.0 1.0]) I change axis limit 

end 

box on 7. makes frame around plot 
xlabel (’V',’FontSize’,14); 
ylabel(’b’,’FontSize’,14); 

set(gca,’FontSize’,14); l size of tick marks 

text(10, 0.96, ’\nu=0’,’Fontsize’,14) 
text(10, 0.8, ’\nu=l’,’Fontsize’,14) 
text(10, 0.55, ’\nu=2’,’Fontsize’,14) 

% 

text(0.1, 0.3, ’a=0’,’Fontsize’,14) 
text(2.5, 0.3, ’a=50’,’Fontsize’,14) 

% 


text(3.3, 0.2, ’a=0’,’Fontsize’,14) 
text(5.8, 0.2, ’a=50’,’Fontsize’,14) 
% 


text(6, 0.1, ’a=0’,’Fontsize’,14) 
text(8.8, 0.1, ’a=50’,’Fontsize’,14) 

close all 


both axes 
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Listing 4A.2.1 Program a3L.m. Determines propagation constants and plots field profiles 
for an asymmetric three-layer planar waveguide. Uses funcMsym.m. 


% File name: a3L.m 
l Analysis for TE modes 

7« First, we conduct test plots to find ranges of beta where possible 
7, solutions exists. This is done in several steps: 

7« 1. Plots are done treating kappa_f as an independent variable 
l 2. Ranges of kappa_f are determined where there are zeros of functions 
7« 3. Corresponding ranges of beta are determined 
7« 4. Searches are performed to find propagation constants 
1 


clear all 

7« Definition of structure 

n_f = 1.50; 

n_s = 1.45; 

n_c = 1.40; 

lambda = 1.0; 

h = 5.0; 

a_c = 2*h; 

a_s = 2*h; 


7. ref. index of film layer 

7. ref. index of substrate 

7. ref. index of cladding 

7. wavelength in microns 

7. thickness of film layer in microns 

7. thickness of cladding region 

7. thickness of substrate region 


k = 2*pi/lambda; 7. wave number 

kappa_f =0:0.01:3.0; % establish range of kappa_f 

beta_temp = sqrt((n_f*k )~2 - kappa_f.~2); 

beta_min = min(beta_temp); 

beta_max = max(beta_temp); 

7. Before searches, we plot search function versus beta 
beta = beta_min:0.001:beta_max; 7« establish range of beta 
N = beta./k; 


ff = func_asym(beta,n_c,n_s,n_f,k,h); 
plot(beta.ff) 

xlabel(’\beta’,’FontSize’,22); 

ylabel(’Search function’,’FontSize’,22); 

ylim( [-10.0 10.0]) 

grid on 

pause 

close all 

7. From the above plot, one must choose proper search range for each mode. 
I Search numbers provided below are only for the waveguide defined above. 
I For different waveguide, one must choose different ranges 
7. for searches. 

betaO = fzero(@(beta) func_asym(beta,n_c,n_s,n_f,k,h),[9.40 9.41]) 
betal = fzero(@(beta) func_asym(beta,n_c,n_s,n_f,k,h),[9.35 9.37]) 
beta2 = fzero(@(beta) func_asym(beta,n_c,n_s,n_f,k,h),[9.27 9.29]) 
beta3 = fzero(@(beta) func_asym(beta,n_c,n_s,n_f,k,h),[9.17 9.18]) 
l Plot of field profiles 
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A_s = 1.0; 

thickness = h + a_c + a_s; 

beta_field = betaO; 7, Select appropriate propagation constant for plotting 
gamma_s = sqrt(beta_field~2 - (n_s*k)~2); 
gamma_c = sqrt(beta_field~2 - (n_c*k)~2); 
kappa_f = sqrt((n_f*k)"2 - beta_field~2); 

% 

“/, In the formulas below for electric field E_y we have shifted 
’/, x-coordinate by a_s 

’/, We also ’reversed’ direction of plot in the substrate region 
NN = 100; 

delta = thickness/NN; 

x = 0.0:delta:thickness; '/. coordinates of plot points 

x_t = 0; 
for i=l:NN+1 

x_t(i+l)= x_t(i) + delta; 
if (x_t(i)<=a_s); 

E_y(i) = A_s*exp(gamma_s*(x_t(i)-a_s)); 
elseif (a_s<=x_t(i)) && (x_t(i)<=a_s+h); 

E_y(i) = A_s*(cos(kappa_f*(x_t(i)-a_s))+... 

gamma_s*sin(kappa_f*(x_t(i)-a_s))/kappa_f); 
else (a_s+h<=x_t(i)) & (x_t(i)<=thickness); 

E_y(i) = A_s*(cos(kappa_f*h)+gamma_s*sin(kappa_f*h)/kappa_f)... 
*exp(-gamma_c*(x_t(i)-h-a_s)); 

% 

h=plot(x,E_y); 

“/, add text on x-axix and y-axis and size of x and y labels 
xlabel(’x (microns)’,’FontSize’,22); 
ylabel(’TE electric field’,’FontSize’,22); 

set(h,’LineWidth’, 1.5) ; “/, new thickness of plotting lines 

set(gca,’FontSize’,22) ; “/, new size of tick marks on both axes 

grid on 
pause 
close all 


Listing 4A.2.2 Function func asym.m. Search function for the asymmetric three-layer 
waveguide. Used by program a3L.m. 

function f = func_asym(beta,n_c,n_s,n_f,k,h) 

7 , Construction of search function for asymmetric 3-layers waveguide 

% 

gamma_c = sqrt(beta.‘2 - (n_c*k)~2); 
gamma_s = sqrt(beta.‘2 - (n_s*k)~2); 
kappa_f = sqrt((n_f*k)~2 - beta.“2); 
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7 . 

denom = kappa_f - (gamma_c.*gamma_s)./kappa_f; 
f = tan(kappa_f*h) - (gamma_s+gamma_c)./denom; 


Listing 4A.3.1 Program slab.m. Driver function which determines propagation constant 
for fundamental TE mode and plots electric field profiles. One-dimensional planar wave¬ 
guide of an arbitrary number of layers is assumed. The list of files is shown in Table 4.12. 

7» File name: slab.m 

% Driver function which determines propagation constants and 
i electric field profiles (TE mode) for multilayered slab structure 
clear all; 
format long 

% Input structure for analysis (select appropriate input) 

7olossless; 

7olossy; 

visser 


epsilon = le-6; 7, 

TE_mode = [] ; 

n_max = max(n_layer); 

zl = n_max; 7. 

n_min = max(n_s,n_c) + 0.001; 7. 

dz = 0.005; 7. 

mode_control = 0; 

7. 

while(zl > n_min) 

zO = zl - dz; 7. 

z2 = 0.5* (zl + zO); 7. 


z_new = muller(@f_TE , zO, zl, 
if (z_new ~= 0) 

7. 

for u=l : length(TE_mode) 

if(abs(TE_mode(u) - z_new) 
mode_control = 1; breal 


numerical parameter 

max value of refractive index 
min value of refractive index 
iteration step 


starting point for Muller method 
starting point for Muller method 
z2); 

verifying for mode existence 

< epsilon) 
c; % mode found 


if (mode_control == 1) 
mode_control = 0; 

else 

TE_mode(length(TE_mode) + 1) = z_new; 


zl = zO; 


% 
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TE_mode = sort(TE_mode, ’descend’); 

7.TE_mode’ 7. outputs all calculated modes 

beta = TE_mode(2); 7. selects mode for plotting field profile 

x = mesh_x(d_s,d_layer,d_c,NumberMesh); 

n_total = [n_s,n_layer,n_c] ; 7. ref index for all layers 

n_mesh = refindex(x,NumberMesh,n_total); 

TE_mode_field = TE_field(beta,n_mesh,x,k_0); 


Listing 4A.3.2 Function lossless.m. Contains data for a lossless waveguide. 


% File name: lossless.m 

7. Contains data for lossless waveguide 

7. Reference: 

% J. Chilwell and I. Hodgkinson, 

% "Thin-films field-transfer matrix theory of planar multilayer 
7, waveguides and reflection from prism-loaded waveguide", 

% J. Opt. Soc. Amer.A, vol.1, pp. 742-753 (1984). 

I Fig.3 

% Global variables to be transferred to function f_TE.m 


% 

global n_layer 

global d_c 
global d_layer 
global d_s 
global k_0 
global NumberMesh 


7„ ref. index cladding 

•/. ref. index of internal layers 

7„ ref. index substrate 

7, thickness of cladding (microns) 

7. thicknesses of internal layers (microns) 
7. thickness of substrate (microns) 

7. wavenumber 

7, number of mesh points in each layer 
7o (including substrate and cladding) 


n_c 

= 1.0; 

n_layer 

= [1.66 1.60 1.53 1.66]; 

n_s 

= 1.5; 

d_c 

= 0.5; 

d_layer 

= [0.5 0.5 0.5 0.5]; 

d_s 

= 1.0; 

NumberMesh 

= [10 10 10 10 10 10] ; 

lambda 

= 0.6328; 7« wavelength 

k_0 

= 2*pi/lambda; 


microns 


Listing 4A.3.3 Function lossy.m. Contains data for a lossy waveguide. 

7, File name: lossy.m 

7. Contains data for lossy waveguide. 

7. Reference: 
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7. C. Chen et al, Proc. SPIE, v.3795 (1999) 


7, Global variables 1 
global n_c 
global n_layer 

global d_c 
global d_layer 
global d_s 
global k_0 
global NumberMesh 


3 be transferred to function f_TE.m 
"/, ref. index cladding 

"/, ref. index of internal layers 

"/, ref. index substrate 

7. thickness of cladding (microns) 

7. thicknesses of internal layers (microns) 
7« thickness of substrate (microns) 

7. wavenumber 

"/, number of mesh points in each layer 
7, (including substrate and cladding) 


d_c 

d_s 

NumberMesh 

lambda 

k_0 


1 . 0 ; 

[3.16455 3.22534 3.39583 3.5321-1j*0.08817 3.39614 3.38327]; 
3.172951; 

1 . 0 ; 

[0.6 1.6 0.518 0.6 0.2 0.1]; 

1 . 0 ; 

[10 10 10 10 10 10 10 10 ]; 

1.523; 7. wavelength in microns 

2*pi/lambda; 


Listing 4A.3.4 Function visser.m. Contains data for a five-layer waveguide described by 
Visser et al. 

7„ File name: visser.m 

7, Contains data for five-layer waveguide with gain and losses 
7. Reference: 

7. T.D. Visser et al, JQE v.31, p.1803 (1995) 

7. Fig.6 

7. Global variables to be transferred to function f_TE.m 

7. 

global n_layer 

global d_c 
global d_layer 
global d_s 
global k_0 
global NumberMesh 

n_c =1.0; 

n_layer = [3.40-li*0.002 3.60+li*0.010 3.40-li*0.002]; 
n_s =1.0; 

d_s = 0.4; 

= [0.6 0.4 0.6] ; 


7, ref. index cladding 
7, ref. index of internal layers 
7, ref. index substrate 
7, thickness of cladding (microns) 

7, thicknesses of internal layers (microns) 
7, thickness of substrate (microns) 

7, wavenumber 

7 0 number of mesh points in each layer 
7. (including substrate and cladding) 
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d_c = 0.5; 

NumberMesh = [10 10 10 10 10]; 

lambda =1.3; % wavelength in microns 

k_0 = 2*pi/lambda; 


Listing 4A.3.5 Function muller.m. This function implements Muller’s method. It is good 
for finding complex roots. 

function f_val = muller (f, xO, xl, x2) 

°i Function implements Muller’s method 

iter_max = 100; ’/, max number of steps in Muller method 

f_tol = le-6; 7. numerical parameters 

x_tol = le-6; 

yO = f(xO); 

yl = f (xl) ; 

y2 = f(x2); 

iter = 0; 

while(iter <= iter_max) 
iter = iter + 1; 

a =( (xl - x2)*(y0 - y2) - (xO - x2)*(yl - y2)) / ... 

( (xO - x2)*(xl - x2)*(x0 - xl) ); 

7. 

b = ( ( xO - x2 )~2 *( yl - y2 ) - ( xl - x2 )~2 *( yO - y2 )) / ... 

( (xO - x2)*(xl - x2)*(x0 - xl) ); 

7. 

c = y2; 


if (a~=0) 

D = sqrt(b*b - 4*a*c); 
ql = b + D; 
q2 = b - D; 

if (abs(ql) < abs(q2)) 
dx = - 2*c/q2; 

else 

dx = - 2*c/ql; 

end 

elseif (b~=0) 
dx = -c/b; 

warning(’Muller method failed to find a root’) 

end 

x3 = x2 + dx; 
xO = xl; 
xl = x2; 
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x2 = x3; 
yo = ylj 
yi = y2; 

y2 = feval(f, x2); 

if (abs(dx) < x_tol && abs (y2) < f_tol) 

’/, Lines below ensure that only proper values are calculated 
if (abs(y2) < f_tol) 
f_val = x2; 

f_val = 0; 


Listing 4A.3.6 Function f TE.m. Using the transfer matrix approach, a transcendental 
equation is created which is then used to determine a propagation constant. 


function result = f_TE(z) 

i Creates function used to determine propagation constant 
•/. Variable description: 

°/ 0 result - expression used in search for propagation constant 
•/. z - actual value of propagation constant 


“/, Global variables: 
“/, Global variables ; 


global n_s “/, 

global n_c “/, 

global n_layer °/ 0 

global d_layer °/ 0 

global k_0 “/, 

/ 


:e used to transfer values from data functions 
ref. index substrate 
ref. index cladding 
ref. index of internal layers 
thicknesses of internal layers (microns) 
wavenumber 


zz=z*k_0; 

NumLayers = length(d_layer); 


°/ 0 Creation for substrate and cladding 
gamma_sub=sqrt(zz~2-(k_0*n_s)"2); 
gamma_clad=sqrt(zz~2-(k_0*n_c)~2); 


°/ 0 Creation of kappa for internal layers 
kappa=sqrt(k_0~2*n_layer.“2-zz.~2); 
temp = kappa.*d_layer; 



102 


Slab waveguides 


% Construction of transfer matrix for first layer 

cc = cos(temp); 

ss = sin(temp); 

m(l, 1) = cc(1); 

m(l,2) = -lj*ss(l)/kappa(l); 

m(2,1) = -lj*kappa(l)*ss(l); 

m(2,2) = cc(l); 

% 

% Construction of transfer matrices for remaining layers 
°/, and multiplication of matrices 
for i=2:NumLayers 
mt(1,1) = cc(i); 
mt(l,2) = -lj*ss(i)/kappa(i); 
mt (2,1) = -lj*ss(i)*kappa(i); 
mt(2,2) = cc(i); 
m = mt*m; 


% 

result = lj*(gamma_clad*m(l,l)+gamma_sub*m(2,2)) . . . 
+ m(2,l) - gamma_sub*gamma_clad*m(l,2); 


Listing 4A.3.7 Function mesh_x.m. Generates a one-dimensional mesh along the 
transversal direction (which is taken as the x-axis) of a dielectric waveguide. 


function x = mesh_x(d_s,d_layer,d_c,NumberMesh) 

°/ Generates one-dimensional mesh along x-axis 
7 Variable description: 

7 Input 

7 d_layer - thicknesses of each layer 

7 NumberMesh - number of mesh points in each layer 
% Output 

7 x - mesh point coordinates 


d_total = [d_s,d_layer,d_c]; 
NumberOfLayers = length(d_total); 
delta = d_total./NumberMesh; 

7 


i thicknesses of all layers 
7, determine number of layers 
7, separation of points for all layers 


x(l) = 0.0; 
i_mesh = 1; 

for k = 1:NumberOfLayers 
for i = 1:NumberMesh(k) 

x(i_mesh+l) = x(i_mesh) 
i_mesh = i_mesh + 1; 


l coordinate of first mesh point 

7 loop over all layers 
7 loop within layer 
delta(k); 
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Listing 4A.3.8 Function refindex.m. Determines the refractive index at all mesh points. 

function n_mesh = refindex(x,interface,index_layer) 

7. Assigns the values of refractive indices to mesh points 
7. in all layers 
7, Input 

7. x() — mesh points coordinates 

7. interface (n) — number of mesh points in layers 

7. index_layer() — refrective index in layers 
7o Output 

7o index_mesh — refractive index for each mesh point 

7. 

7o Within a given layer, refractive index is assigned the same value. 
7o Loop scans over all mesh points. 

7o For all mesh points selected for a given layer, the same 
7o value of refractive index is assigned. 

7. 

N_mesh = length(x); 

NumberOfLayers = length(index_layer); 

7. 

i_mesh = 1; 

for k = 1:NumberOfLayers % loop over all layers 

for i = 1: interface (k) % loop within layer 

n_mesh(i_mesh+l) = index_layer(k); 
i_mesh = i_mesh + 1; 

end 

end 


Listing 4A.3.9 Function TE field.m. Determines TE optical field for all layers. 

function TE_mode_field = TE_field(beta,index_mesh,x,k_zero) 

7, Determines TE optical field for all layers 

7. 

7. x - grid created in mesh_x.m 

TotalMesh = length(x) ; 7, total number of mesh points 

7. 

zz=beta*k_zero; 

7. 

7, Creation of constants at each mesh point 
kappa = 0; 

for n = 1:(TotalMesh) 

kappa(n)=sqrt((k_zero*index_mesh(n))~2-zz~2); 


7. 

7. Establish boundary conditions in first layer (substrate) . 
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7. Values of the fields U and V are numbered by index not by 
7, location along x-axis. 

7. For visualization purposes boundary conditions are set at first point. 
U(l) = 1.0; 

temp = imag(kappa(1)); 
if(temp<0), kappa(l) = - kappa(l); 

’/, The above ensures that we get a field decaying in the substrate 
V(l) = kappa(l); 

1 

for n=2:(TotalMesh) 

cc=cos( kappa(n)*(x(n)-x(n-l)) ); 
ss=sin( kappa(n)*(x(n)-x(n-l)) ); 
m(l,l)=cc; 

m(l,2)=-li/kappa(n)*ss; 
m(2,l)=-li*kappa(n)*ss; 
m(2,2)=cc; 

1 

U(n)=m(l,l)*U(n-l)+m(l,2)*V(n-l); 

V(n)=m(2,l)*U(n-l)+m(2,2)*V(n-l); 

7. 

TE_mode_field = abs(U); 7. Finds Abs(E) 

max_value = max(TE_mode_field); 

h = plot(x,TE_mode_field/max_value) ; 7. plot normalized value of TE field 

7. adds text on x-axix and size of x label 
xlabel(’x (microns) ’ ,’FontSize’,22); 

7. adds text on y-axix and size of y label 
ylabel(’TE electric f ield’, ’FontSize ’ , 22) ; 

set(h,’LineWidth’,1.5) ; 7. new thickness of plotting lines 

set(gca, ’FontSize’ ,22) ; 7. new size of tick marks on both axes 

pause 
close all 
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Linear optical fibre and signal degradation 



In this chapter we will discuss properties of the linear fibre, where refractive index does not 
depend on field intensity. We will concentrate on the following topics: 

• geometrical-optics description 

• wave propagation 

• dispersion in single-mode fibres 

• dispersion-induced limitations. 

Due to dispersion, the width of propagating pulse will increase. In multimode fibres, 
broadening of optical pulses is significant, about 10 ns km -1 . In the geometrical-optics 
description, such broadening is attributed to different paths followed by different rays. In 
a single mode fibre, intermodal dispersion is absent; all energy is transported by a single 
mode. However, pulse broadening exists. Different spectral components of the pulse travel at 
slightly different group velocities. This effect is known as group velocity dispersion (GVD). 

5.1 Geometrical-optics description 


Consider an optical fibre whose cross-section is shown in Fig. 5.1. The corresponding 
change in refractive index profiles is shown in Fig. 5.2. We illustrate two profiles: step 
index and graded index. 

In the following discussion we will assume validity of a geometrical optics description 
which holds in the limit A < a, where A. is the light wavelength, and a is the core radius 

[1], P]. 


5.1.1 Numerical aperture (NA) 


To start our discussion, consider the propagation of rays entering a cylindrical fibre at 
different angles in the plane passing through the core centre, see Fig. 5.3. Ray B will travel 
in the cladding region and it is known as an unguided ray, whereas ray A will stay within 
core region, and it will form a guided ray. 

There exists an angle 9 a (known as the acceptance angle) such that all rays incident from 
a region with refractive index n 0 at angles fulfilling the relation 


will stay within a core region (such as ray A). All rays incident at the acceptance angle 0 a 
will travel at the boundary between core and cladding. 
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Typical profiles of optical fibre. Linear step (left) and graded (right). 



The numerical aperture (NA) is defined as 

NA = n 0 sin 6 a 

It is a dimensionless quantity. Typical values of NA are within the 0.14-0.50 range. Eval¬ 
uation of NA follows exactly the same procedure as described in Chapter 4. The final 
expression is 

NA = rh (2A) 1/2 (5.1) 

In practice n\ n 2 and the relative refractive index difference A is 

A _ (ni + n 2 ) (ni - n 2 ) ^ 2n t (m - n 2 ) _ ni~n 2 
2 n\ 2 n\ n\ 


A typical value of parameter A for a single-mode fibre is around 0.01. 
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5.1.2 Multipath dispersion 


A very important physical effect originates from the fact that different rays travel along 
paths of different lengths. The shortest path, for incident angle 9\ = 0 is equal to L, fibre 
length; the longest path, for 6\ = 0 max (which corresponds to 4>c) is for rays travelling the 
distance Time delay between both paths is AT. It can be estimated as follows: 

A distance 



where we have used v = for speed of light in fibre. Time AT is a measure of broadening 
of an impulse. 


5.1.3 Information-carrying capacity of the fibre 


Next, we will estimate the information-carrying capacity of the fibre. If we introduce B as 
a bit rate, and L fibre length, then the measure of the information-capacity of the fibre is 
determined by a product B ■ L. 

The estimation of this product is as follows. Let T B where T B = \ be the time allocated 
for each bit slot. We use the condition 


AT < T b 


(5.4) 


which requires that pulse broadening is smaller than time slot allocated for each bit. Let us 
substitute the previous expression for AT, and we obtain 
Ln\ 

-^-A B < 1 or 
c« 2 


LB 


ri2 c 

Wa 


(5.5) 


Example Assume n\ = 1.5, ni = 1.0. From the above, one finds L ■ B < 0.4^ • km. On 
the other hand, for A = 2 x 10 -3 => 5 ■ L < 100 ^ ■ km. This means that one can carry 
data of 10 Mb s -1 over 10 km. 


5.1.4 Loss mechanisms in silica fibre 


Attenuation (losses) of optical signal is a maj or factor in the design of optical communication 
systems. Losses can occur at the input coupler, splices and connectors but also within the 
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wavelength (microns) 


Spectrum of losses in glass. Adapted from [3] with permission from the Institution of Engineering and Technology. 


fibre itself. In this section we will concentrate on losses in the fibre. A typical spectrum of 
losses in modem fibre is shown in Fig. 5.4. Losses are usually expressed in decibels per 
unit of distance (as was mentioned in Chapter 2) according to formula 

10 Pouf 

ocioss = L logm p ( 5 - 6 ) 

where P in and P mt are, respectively, the input and output optical powers for a fibre of 
length L. 

Loss mechanisms are classified as [4]: 

• intrinsic which arise from the fundamental material properties of the glass. These can be 
reduced, e.g. by appropriate choice of wavelength 

• extrinsic which result due to imperfections in the fabrication process. Sources of extrinsic 
loss include impurities or structural imperfections. 

5.1.5 Intrinsic loss 


In pure silica, the three sources of loss important at visible and near-infrared wavelengths 
are: 

• ultraviolet absorption 

• infrared absorption 

• Rayleigh scattering. 

All of the three mechanisms are wavelength dependent. The first two involve two reso¬ 
nances centred in the ultraviolet and mid-infrared, see Fig. 5.4. They originate due to strong 
electronic and molecular transition bands. The ultraviolet resonance is of electronic origin 
and is centred near )■, = 0.1 pm [4], The infrared absorption originates from lattice vibra¬ 
tional modes of silica and dopants. The vibrational modes produce absorptive resonances 
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centered between 7 pm and 11 pern. The resonances for silica and germania occur at 9 pun 
and 11 pun, respectively [4], Intraband absorption results in a power loss approximately 
given by 

a ir = A exp [dB km -1 ] (5.7) 

where A is in micrometres. For GeOi — SiC>2 glass, the values of A and a lr are A — 
7.81 x 10 11 dB km" 1 and a ir = 48.48 pun [4], 

Rayleigh scattering in pure fused silica can be approximated by the expression 



where A. is in micrometres and loss «« is in dB km -1 . 

Rayleigh scattering appears when a wave travels through a medium having scattering 
objects smaller than a wavelength [5], In a glass those small objects are created as lo¬ 
cal variations in refractive index in otherwise homogeneous material. Local variations of 
refractive index in turn can be due to localized variations in a material’s density. 

5.1.6 Extrinsic loss 


These losses are not associated with the fundamental properties of material. There are 
many sources of extrinsic losses, for example presence of additional impurities and surface 
irregularities at the core-cladding boundary. Some other related losses are due to bending 
and source-fibre coupling. 


5.2 Fibre modes in cylindrical coordinates 


Fibre exhibits cylindrical symmetry, see Fig. 5.5. In the cylindrical coordinate system we 
use the following variables: r, <fi, z. Refractive index is expressed as (see Fig. 5.1) 


n = 


(5.9) 


where a is the radius of the central region known as core. 
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5.2.1 Maxwell's equations in cylindrical coordinates 


In cylindrical coordinates, one has the following general relation for arbitrary vector A [6]: 

i a r '"a,/, a- 

Vy a _ : I A. A 

VXA ~ r dr 34> 3z 

A r rA ^ A z 

^fdA z 9A 0 \ _ fdA r dA z \ _ira, . BA r I 

= *' W -17 ) + - H7> +a -r [at K) - wJ 

where a r , a r/ ,, % are unit vectors in the corresponding directions. Vector A is expressed as 
A = a r A r + a^ + a Z A Z 

Assuming e Jcot dependence for fields, the first Maxwell equation in cylindrical coordinates 


BE, 8E Z . tj 

~ 


i r a , „ v 8E r i 


(5.10) 

(5.11) 

(5.12) 


Assuming additionally e J ^ z dependence, one finds 
1 dE, 

rlhj) + ^ E<t> = ~J wllHr 
dE z 

jfiEr + dr = jcoiiH^, 

i r a , . 8ea 
r[* { t) ~H\ = 

Similarly, from the second Maxwell equation we obtain 

(5.13) 

(5.14) 

= <5 - 15) 

Expressing//,, from (5.10) and (5.14) and comparing both expressions, one obtains for 

r j (P8E Z 3H Z \ 

= <5 ' 16) 


1 dH. 

= J ° JSE ' 


q = co eh ~ P 

= nX~P 2 


(5.17) 
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Similarly, using the same procedure, from (5.11) and (5.13) one obtains for E r 
i ( dE. coudH z \ 

In an analogous way one finds expressions for Hq and H r : 

Equations (5.16), (5.18), (5.19) and (5.20) express relevant components of E and H fields in 
terms of E- and H z components. In the next step, we derive equations for those components. 


5.2.2 Wave equations in cylindrical coordinates 


From the analysis above we can observe that the z-component of the electric field E z (and 
same applies also to magnetic z-component H z ) can be considered an independent variable 
and that the remaining components can be expressed in terms of E z . (The same is true for 
H z .) Hence, we only need to work with E z (or H.). For that purpose we will derive the 
appropriate wave equation from which we can obtain a solution for E z . Once we have a 
solution for E- we will use it to find E r and Eq. In this way, all components of the electric 
field within a circular waveguide are derived. 

To derive the equation for E z we start with Eq. (5.15). Replacing derivatives on the left 
hand side by expressions (5.19) and (5.20) and rearranging terms, one finds 


d 2 E z 

~d^ 


1 dEr 1 d 2 E z 2 


(5.21) 


where q 2 is given by Eq. (5.17). 

Similarly, we can derive an equation for H z . For the last step we will use Eq. (5.12). We 
replace Eq and E r and obtain a wave equation for II Z in cylindrical coordinates 


d 2 H z 1 dH z 
dr 2 r dr 


1 d 2 H z 


+ n 2 k%H z = 0 


(5.22) 


5.2.3 Solution of the wave equation in cylindrical coordinates 


Here, we will analyse the equation for E z . An equation for H- can be solved in a similar 
way. Assume that the following separation of variables is possible: 

E z (r,<P)=R(r)ct>( ( t)) (5.23) 


After separation, one obtains two equations 


rf 2 ch(d>) 

d(f> 2 


m 2 d>«>) = 0 


(5.24) 
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2 d 2 R dR , 2 2 


where m 2 is a separation constant. Eq. (5.25) is known as the Bessel equation and its 
solutions are known as Bessel functions [7]. The solution of Eq. (5.24) is 


0(0) = e ,mtl> 


(5.26) 


Here, m takes integer values because 0(0 + 27r) = 0(0). 

The Bessel equation will be written and solved separately for core and cladding regions. 
In the core region it reads 


2 d 2 R dR , 2 , ,, 

+ -") S = ° 


(5.27) 


where 


k 2 =n 2 l l$-p 2 >0 


(5.28) 


The solutions of this equation are known as ordinary Bessel functions J m and N m of the first 
order and are shown in Fig. 5.6. 

The Bessel equation for the cladding region is 

r2 S +r S + ^ /2 +m2 ) R = 0 < 5 ' 29 ) 

where 

y 2 = f} 2 -n 2 k 2 > 0 (5.30) 


The solutions of this equation are known as modified Bessel functions, K m I m of the second 
kind and of order m. Those functions are plotted in Fig. 5.7. 

MATLAB codes for obtaining plots of Bessel functions are provided in Appendix 5C.1. 
Using the above results for solutions of Bessel equations, one can obtain the solutions for 
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core and cladding regions as [4] 

AJ m ( Kr ) + A'Y m {icr), r < a 
CK m (yr) + C'I m ( yr ), r > a 

where A,A',C,C are constants. 

We expect the following properties of guided modes: 



(5.31) 


1. the solution in the core will be oscillatory and finite at r = 0 

2. the solution in the cladding will decrease monotonically as radius increases, i.e. 
field 0 for r ->• oo. 


The above properties imply that for core the J m function should be used and since Y m has 
a singularity (see Fig. 5.6), for r = 0 we set A' = 0. Similarly for cladding the function I m 
increases (see Fig. 5.7) and we set C' = 0. In cladding we use only K m . 

In this way one finds the solutions 


jAJ m (/cr)e im ^, r < a 

Z ~ [CK m (yr) r>a 

In exactly the same way 

f BJ m (Kr) r < a 

\DK m (yr)e mKl \ r>a 


(5.32) 


(5.33) 


Employing Eqs. (5.16), (5.18), (5.19) and (5.20), one can express the other field compo¬ 
nents in terms of E z and H z . After a little algebra, the full solutions are: 

Case: r < a (core) 

E < f, = ^ ^ j—mAJ m ( Kr ) (/cr)^ e' m<l> (5.34) 

E r = (P* A JL (Kr) + j—mBJ m (/cr)) ^ (5.35) 




(ws\icAJ' m (i Kr ) + j^mBJ„, (/cr)j e J ' n<p 


(5.36) 
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H ' =-^2 (P KBJ ' m (* r ) ~ j~~mAJm Ocr)) e jm,l> 
Case: r > a (cladding) 

E 4 , = ^ (j^mCK m ( yr ) -w^yDK' m (yr)^ e> m * 
E r=y 2 ( PYCK' m (yr) +j°^mDK m (yr)) e** 
H 4 = -4 ^ft>e 2 yC^; (yr) + j^mDK m (yr) j e’ rmt ’ 


H r = - 7 2 (pyDK’ m (yr) - j°^mCK m (yr)) e /m0 
where we have introduced the following definitions 


•4 (*r) = 


dJ m (nr) 
d(Kr) 


and 


K (yr) = 


K m (yr) 
d(yr) 


5.2.4 Boundary conditions and modal equation 


(5.37) 

(5.38) 

(5.39) 

(5.40) 

(5.41) 


We use boundary conditions derived previously which require that all field components 
tangent to the core-cladding boundary at r = a must be continuous across it. We therefore 
have [8] 

E z , H z , E,p, fl 4> are continuous at r = a (5.42) 


Applying the above requirement, we obtain the following equations: 

1. from continuity of E z 

AJ m (tea) = CK m (ya) 


2. from continuity of EI Z 

3. from continuity of E$ 


BJ m (ica) = DK m (ya) 


^ J m (ko)A + j—J' m (tea) B + ^-K m (ya) C + j—K' m (ya) D = 0 


(5.43) 

(5.44) 


(5.45) 


4. from continuity of 

- j—J' (Ka)A + ——^J m (<ca) B — j^^K' (ya) C + ^—^K m (ya) D = 0 (5.46) 
k uk * 1 2 3 4 y ay 2 
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The above form a set of simultaneous equations for unknown coefficients A, B,C, D. The 
nontrivial solution is obtained if the determinant vanishes, i.e. 

J m {tea) 0 -K m (ya) 0 

0 J m (tea) 0 -K m (ya) 

J m (tea) Oca) (ya) j^^K' (ya) = 0 (5.47) 

K z a k y 2 3 a y 

.(os i . , fim ,coe 2 ^ Pm 

~J - J m (xa) —J m (Ka) -j - K m (ya) — T K m (ya) 

k an * 1 y ay 

Expansion and evaluation of this determinant is described in Appendix 5B. The resulting 
characteristic equation is 

(ko) k$n 2 2 K' m (ya) 1 _ ph 

m (xa) yK m (ya) J 

(5.48) 


J’ m (xa) 

[ kJ m (ko) ' yK„ 


■’ m (ya) 1 f k^n\f m (I 
K m (ya) J[ KJ m (Kt 




This equation forms the basis for obtaining propagation constants for guided modes prop¬ 
agating in cylindrical fibre. Before we start a detailed discussion, let us look at the general 
classification of modes. 


5.2.5 Mode classification 


Variable m is the main parameter used in mode classification. It is known as azimuthal 
mode number. Because of the oscillatory behavior of J m (ko) for a given value of m as 
seen in Figs. 5.6, multiple solutions exist for each integer value m. We denote them as P„ m , 
where n = 1,2,... Index n is called the radial mode number and it represents the number 
of radial modes that exist in the field distribution [4], 

In general, electromagnetic waves in a step index optical cylindrical fibre can be divided 
into three distinct categories [9]: 

1. Transverse electric (TE); sometimes designated as magnetic (H) waves. They are char¬ 
acterized by E 2 = 0 and H z ± 0. 

2. Transverse magnetic (TM); sometimes designated as electric (E) waves. They are char¬ 
acterized by E 2 ^ 0 and H z = 0. 

3. Hybrid waves. Characterized by E z ^ 0 and H z ± 0. 

In more detail, the following cases are treated separately [8]: 

1. m = 0. Still, solutions are separated into two groups: 

a. TMo„ (transverse magnetic) because E^, = H r = H- = 0. In this case coefficients 
B = D= 0. 

b. TE 0n (transverse electric) because = E r = E z = 0. In this case coefficients 
A = C = 0. 

2. m ± 0. Hybrid modes. 

The modes are called EH or HE modes (terminology has historical roots traced to 
microwaves). 
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Values of ft will correspond to modes which have finite components of both E z and H-. 
There are neither TE nor TM modes. General classification is [10], [11], [12], 

if A — 0 the mode is called TE mode 
if B — 0 the mode is called TM mode 
if A > B — 0 the mode is called HE mode ( E z dominates H) 
if A < B = 0 the mode is called EH mode ( H z dominates E-). 

Now, we will discuss some of those cases in more detail. 

5.2.6 Modes with m = 0 


The simplest solutions of characteristic Eq. (5.48) are obtained for m = 0. In this case there 
is no angular dependence on cp, and therefore field components E z and H z are rotationally 
invariant. The characteristic equation is reduced into two equations which describe TE and 
TM modes: 

TE modes 


Jq(kci) Kq (ya ) 
kJ 0 ( Ka ) yK 0 (ya) 


(5.49) 


TM modes 


J'o (Ka) ^ n\ K’q (ya) 
kJ 0 (ko) n] yK 0 (ya) 


(5.50) 


Using the following relations for Bessel functions, (see Appendix 5A) 


■/'(») _ J-i(u) ^ W' (io) _ K_i (w) 
Jo (u) Jo (u) wK 0 (w) wK 0 (w) 

and J-\ = (— 1)'J|, K_\ = K\ , one obtains 


J\ (u) u K\ (u>) _ 
Mu) wK 0 (w) = 

Jliu) n\ u K\(w) 
Jo(u) n\wK 0 (w) 


(5.51) 


(5.52) 


where we used definitions u = aK and w = ay. 

The interpretation of relations (5.51) and (5.52) as describing TE and TM modes comes 
from the following observations. 

From continuity equations for E. and H z (established in the previous section), one can 
express constants C and D in terms of A and B. Substitute these results into the continuity 
equation for and obtain 



a 


(h 


+ ja>H 


' J' m (Ka) 
KJ m (ko) 


K' m (ya) ] 
yK m (ya) \ 


B = 0 


When the term in the square bracket is zero, then A = 0 and because of Eq. (5.32), 
E z = 0 which means that electric field is transverse, i.e. TE mode. Similarly, substituting 
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expressions for constants C and D into continuity equation for one obtains 


[ 2 J'Jica) 
y x tcJ m ( Ka ) 


;j 2 K(ra) l 1 Prn 
2 yK m (ya)\ 


B = 0 


When the term in the square bracket is zero, then B — 0 and because of Eq. (5.33), H z = 0 
which means that magnetic field is transverse, i.e. TM mode. 


Interpretation 

Here, we will provide an interpretation of relations (5.51) and (5.52) which describe TE 
and TM modes. 

1. TM modes 

Assuming B = D = 0 (and also m = 0), from Eqs. (5.34)-(5.37) in the core region one 
finds 

E ( „ = 0, E r = —~PAJq (jcr) 

= - —o)£\AJq {kv) , H r = 0 
Also, from (5.32) and (5.33) one has 

E z = AJ 0 ( ter ) , H z = 0 

For such modes longitudinal magnetic component H z is zero and it is therefore called TM 
mode (transverse magnetic). Designation of such modes is TMq„. 

2. TE modes 

In a similar way, taking A = C = 0 (and also m = 0), from Eqs. (5.34)-(5.37) in the 
core region one obtains 

E ^ = -coiiBJq ( Kr ), E r = 0 
T7 0 = 0, H r = --PBJq (ter) 
and also, from (5.32) and (5.33) one has 

E z = 0, H z = BJ 0 ( Kr ) 

For such modes longitudinal electric component E z is zero and it is therefore called TE 
mode (transverse electric). Designation of such modes is TEo„. 

A summary of mode properties for m = 0 is provided in Table 5.1. 

5.2.7 Weakly guiding approximation (wga) 


General solution of the dispersion Eq. (5.48) is very complicated. Gloge [13] developed an 
approximation known as the weakly guiding approximation (wga). It is based on the fact 
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Table 5.1 Classification of modes for m = 0. 


description 

Values of 

Equation coefficients 

Nonzero field 
components 

Zero field 

components 

TE 0n 

^ = c = 0 

uJ 0 («) yKo (w) 

4,4,4 

E z = 4 = 4 = 0 

TM 0 „ 

JUn) n\ KL(w) 

0 W + 2 V) ^ — 0 B = D = 0 

uJq (u) n\ yK 0 ( w ) 

4,4,4 

4,4 ,4 = 0 


that«i — «2 4C 1. Under such approximation n\~n\ = n 2 and also fi 1 ~ k^n 2 . Introducing 
definitions u = tea and w = ya, general Eq. (5.48) reduces to 


One can observe that two possible cases exist. We will consider them separately. 


+ K' m (w) =±m (]_ + J_ 
uJ m ( u ) wK m ( w ) \u 2 w 2 


(5.53) 


Case *+’: 

Applying properties of Bessel functions (see Appendix 5A) given by Eqs. (5.85) and (5.87), 
from Eq. (5.53) one obtains 


Jm+du) | K m+l (w) 
uJ m (u) wK m (w) 


(5.54) 


Case 

Applying properties of Bessel functions (see Appendix 5A) given by Eqs. (5.84) and (5.86), 
from Eq. (5.53) one has 


■4-1 (u) K m _ i ( w ) 
uJ m ( u) + wK m (w) 


(5.55) 


Eigenvalue Eq. (5.53) is solved numerically to find propagation constant jB. In general, 
it has multiple solutions for each m. They are labelled by n (n = 1,2,...). Thus ft = p mn . 
Each value of corresponds to one possible mode. Fibre modes are referred to as hybrid 
modes and denoted by [4] 


HE mn (dominates// z ) 
EH mn (dominates/?;) 


In special case m = 0 


HE 0n = TE 0n 1 
EH 0n = TM 0n \ 


correspond to transverse electric and transverse magnetic 


LP modes 

LP modes originate in a weakly guiding approximation. Those approximate solutions are 
much simpler than the exact solutions. Solving characteristic equation leads to similar 
eigenvalues for both the EH m _ [n modes and HE m+ \ n [9], If the fields of these modes are 
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combined, the resulting transversal field will be linearly polarized. The linearly polarized 
modes are labelled as LP m n . They are composed in the following way: 

LP 0n = HE ln 
LP\ n = HE ln +E 0n +H 0n 
LP m n = HE m+ \ n + EH m _m> 2 

For each n there are two LPq„ modes, polarized perpendicular to each other, while for each 
n and each m > 0 there are four LP m n . Here 


LP mn (LP stands for linearly polarized) 


At the end, let us summarize some important definitions: 



which is known as mode index or effective index, and also 

V = koa ( rin\ - r,& ^-dhisflK 

where 



n\ 


and finally 



«t - «2 «i - «2 


To finish this section, consider an example. 


(5.56) 


Example Typical values (for multimode fibres) are a = 25 pm, A = 5 x 10 -3 , V ^ 18, 
at a = 1.3 pm. Using approximate expression M = V 2 /2, where M is the total number of 
modes (see Problem 1), one finds out that this particular fibre supports approximately 162 
modes. 


5.2.8 The unified expression 


Equations derived so far for TE, TM, EH and HE modes in the weakly guiding approxi¬ 
mation can be written in the form of a single equation. One should notice that in the wga, 
equations describing TE and TM modes degenerate into a single equation. The Eq. (5.55) 
for HE modes requires modification. First, write Eq. (5.55) as 


uJ m ( u ) wK m ( w ) 
J m -1 («) + K m -1 (w) 


(5.57) 


In order to appropriately modify the above equations, consider general formulas for Bessel 
functions provided in the Appendix, Eqs. (5.79) and (5. 80). By changing index as m +1 —► m 
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in those equations, one obtains 

uJ m (u) = 2{m - l)J m -i(u) - uJ m - 2 {u ) 
wK m (w) = 2(7 n - 1 )K m _\ (w) + wK m _ 2 (u) 


The above equations can now be substituted into (5.57) and finally obtain 
uJ m -2 (m) wK m -2 (w) 

J m (u) = K m {w) 


(5.58) 


Equations (5.54) and (5.58) can now be written as a single equation if we define new index 
k as 

I I for both TE and TM modes 

7M+1 for EH modes 
777 — 1 for HE modes 


The universal equation reads 


uJk -1 ju) 

Mu) 


uiKk-i (w) 
K t (w) 


(5.59) 


5.2.9 Universal relation for fundamental mode HE\ \ 


Fundamental mode HE\\ (or LPq\) has the following dispersion relation (corresponding to 

777 = 1): 


JUw) M Ki(w) 
J Q (u) + wK 0 (w) 


(5.60) 


where u = a^n\kl — f)' 


and w — c 


and 


P 2 # 2 -»2*o 

n\ - n\ n\k 2 0 - n 2 k 2 
V = k 0 a(nl-nl) l/2 


From these definitions, one finds 



After a little algebra 

u 1 = a 2 {n\k%, — P 2 ) = a 2 n\k% — a 2 n\k^ + a 2 n\l^ — a 2 p 2 
= v 2 -w 2 = V 2 - V 2 b 


The arguments in Eq. (5.60) can therefore be expressed 
u = Vs/T^b 


and 


= vVb 
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1.5 2 2.5 


Plot of the waveguide parameter b and its derivatives d (JbV)/dV and Vd 2 (bV)/dV 2 as a function of the V 
numberfor//£n mode. 


Replacing arguments in Eq. (5.60) with the above relations, we finally obtain 

MrVl^b) v* Xo(FVS) 

The above formula holds for mode HE\\. It contains only two variables V and b. It is 
therefore called a universal relation. In Appendix 5C.2 we provided MATLAB program 
which plots b = b(V) relation and also two other functions ‘pp and v p^P which are 
important in determining dispersion relation for propagating mode. Those relations are 
shown in Fig. 5.8. 


5.2.10 Single-mode fibres 


The single-mode fibre supports only HE] \ mode (fundamental mode which corresponds to 
m = 0). From Eq. (5.48) we have 

KJo(ica)K' 0 (ya) + yJo(ya)Ko(ya) = 0 
KnjJo (ica)K' 0 (ya) + yn\j^ (ya)K 0 {ya) = 0 

At cutoff, b = 0 which gives that f }/kq = N and that y = 0. Also, K 2 a 2 = a 2 ^ (n 2 — n\ ). 
Thus V = tea when y — 0. 

Thus cutoff condition is 

Jo ( V) = 0 (5.62) 


which gives (for single mode) 


V = 2.405 


( 5 . 63 ) 
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Plot of Bessel functions for /w = 0 and m = 1 used to illustrate cutoff conditions. 


Example Estimates of a. 

Assume X = 1.2 pm, n\ — 1.45 and A = 5 x 10 -3 . For those values, the above condition 
for V gives a < 3.2 pm. 


5.2.11 Cutoff conditions 


The condition b = 0, which results in /l 2 = kpi\, defines the so-called cutoff of the mode 
[14], A mode is cut off when its field in the cladding becomes evanescent and is detached 
from the guide. At cutoff = k^h 2 and also b = 0, V = V c . The situation is illustrated in 
Fig. 5.9 for m = 0 and m — 1. Case m = 0 corresponds to modes TE 0 „ and TM 0 „. The 
cutoff conditions for those modes are obtained from roots of the following equation: 

J 0 (K c a) = 0 (5.64) 

Cutoff conditions are illustrated in Fig. 5.9. For m = 0 the few lowest roots are shown. They 
correspond to K c a = 2.405 and K c a = 5.52. For m = 1 the lowest cutoffs are: K c a = 3.83 
and K c a = 7.02. 


5.3 Dispersion 


Dispersion is an effect responsible for spreading of a signal in time when it propagates. 
Signal is represented by pulse, so effectively we talk about pulse broadening. 

Transmitted pulses broaden when they propagate in dispersive media. After long propa¬ 
gation it might be impossible to distinguish individual pulses, see Fig. 5.10. 
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(a) (b) 

The effect of dispersion on short temporal pulses, (a) Before entering dispersive medium, (b) After propagation in a 
dispersive medium. 

The main types of dispersion are [10]: 

1. Material dispersion. 

It exists when refractive index n(k) depends on wavelength. As a result, different wave¬ 
lengths travel with different velocities. 

2. Modal dispersion. 

Exists in waveguides with more than one propagating mode. Each mode propagates with 
different velocity. 

3. Waveguide dispersion. 

Exists because propagation constant ft depends on the wavelength, so even for n(k) = 
const different wavelengths will propagate with different speeds. Usually the smallest, 
but important close to the wavelength where dispersion is zero. 

5.3.1 Group delay-general discussion 


In a fibre, different spectral components of the pulse travel at slightly different group 
velocities. They produce an effect known as group-velocity dispersion (GVD), also known 
as intermodal dispersion or fibre dispersion. The name used depends on what description 
is used (geometrical optics or wave optics). 

Detailed discussion is based on the following assumptions: 

• optical signal excites all modes equally at the input of the fibre 

• each mode contains all spectral components 

• each spectral component travels independently (in the fibre) 

• each spectral component undergoes a time delay or group delay. 

Transit time of a pulse travelling through the fibre of length L is 


L 


(5.65) 


where r g - time delay or group delay, L - distance, v g - group velocity, ft - propagation 
constant. 

Group delay determines the transit time of a pulse travelling through the fibre of length 
L. Group velocity is defined as 


1 _ dp 
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Typically, one can expresses r g in terms of derivatives with respect to A: or A. or V. For that 
purpose we will summarize several relations: 


co = 2nv, v = p £=^-, c0 = kc, V = («i — nf) 172 • a- k 
From the above relations, one finds the following useful formulas: 


d _ d dk _ dk dx dk __2n d _ _ X 1 d 
dco cdk’ dX dX X 2 ’ dk 2n dX 


(5.67) 


To evaluate derivative with respect to V, one observes that ft = ft (k(V) ). Therefore 
d^ _ dp_dV_ _ VcW 
~dk ~ dV~dk ~ ~kdV 

where we applied the following result obtained using expression for V: 


dV _ V 
dk k 


Using the above relations allows us to express r g in several forms 


= L dl = Ldl = LVdl = _LX^dl 
X% dco c dk c k dV 2 jrc dX 

Because r g depends on A., each spectral component of any particular mode travels the same 
distance in different time. 

At this stage, a typical procedure is to explicitly consider two main physical mechanisms 
which contribute to group delay r g . One can therefore write 


Ldp I Ldp I 

c dk |^ co „ s , ^ c dk | H=C 


(5.69) 


where x mat is known as material dispersion whereas r wg is a waveguide dispersion. 

Material dispersion accounts for the fact that refractive index depends on wavelength; i.e. 
n = n(X). If we neglect dependence of refractive index on wavelength, i.e. set n = const, 
then due to the fact that group velocity depends on X one obtains the contribution known 
as waveguide dispersion. We now consider both contributions separately. 


5.3.2 Material dispersion: Sellmeier equation 


Material dispersion originates from the fact that refractive index n(X) is wavelength depen¬ 
dent. Propagation constant fi is 



Use the above and Eq. (5.68) to obtain 



(5.70) 
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wavelength (pm) 

Wavelength dependence of refractive index based on the Sellmeier equation for Si0 2 . 


Let us concentrate now on material dispersion in silica. In practice, very often an empirical 
expression for the refractive index of glass in terms of the wavelength X is used. It is known 
as the Sellmeier equation and it was introduced in Chapter 2. Its form is 


(5.71) 


Here G\,G 2 , G 3 and a, , X 2 , X 3 are constants (called Sellmeier coefficients) which are 
determined by fitting the above expression to the experimental data. Often in literature 
more elaborate expressions are used. Using data provided in Table 2.2, we plotted in 
Fig. 5.11 a refractive index for Si0 2 as a function of wavelength. MATL AB code is provided 
in Appendix 5C.3. 


5.3.3 Waveguide dispersion 


Waveguide dispersion exists because group velocity v g depends on the wavelength. The 
expression for waveguide dispersion x wg is expressed in terms of quantity b introduced 
earlier, so we start with the definition of parameter b 

b _ P 1 /k 2 ~ n\ ^ P/k - n 2 
n\ -n\ «i - «2 

where we used the fact that f>!k~n\ sa n 2 . We use the above equation to determine P as 


= kn 2 {\ + bA) 


(5.72) 


Here A = Waveguide dispersion is obtained by evaluatind derivative dp/dk using 
Eq. (5.72) and assuming that n 2 is k independent. One finds 


dp 

dk 







Pulse dispersion during propagation 


Substitute the above result into general definition (5.69) and have 


L 


+ «2 A 


d{Vb) \ 
dV ) 


(5.73) 


5.4 Pulse dispersion during propagation 


Because r g depends on A, each spectral component of any particular mode travels the same 
distance in different time. As a result, optical pulse spreads out with time. Let AX be a 
spectral width of a source. Each wavelength component within AA will propagate with 
different group velocity, resulting in a temporal broadening of a pulse. Pulse broadening 
Ar is expressed as 

At AX (5.74) 

dX 

Assuming only material dispersion, i.e. x = x mat and using previous result (5.70) we have 
dx ma t L | dn dn d 2 n 1 L d 2 n 

dX c jdA dX k dX 2 J c k dk 2 

Therefore At mat can be written as 




From the above, one can see that material dispersion can be specified in units of picoseconds 
per kilometre (length of fibre) per nanometre (spectral width of the source). One usually 
introduces 


D mat = 


4 


1 dx mat 
L dX 


which is known as material dispersion coefficient. In practical emits [14] 



(5.76) 


(5.77) 


where A is in nm and velocity of light c = 3 x 10 5 km s _ 1 . Typical behaviour for Si02—13.5% 
Ge02 is shown in Fig. 5.12. MATLAB code is provided in Appendix 5C.4. 

For waveguide dispersion, we can also determine waveguide dispersion coefficient D wg 
which is defined as 


Here Ax wg is the pulse broadening due to waveguide. It can be expressed as 
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wavelength (pm) 


Material dispersion as a function of optical wavelength. 


Using Eq. (5.73), one can evaluate It is, however, typical to express it in terms of 
parameter V, which is V = V2A = i. Taking derivative 
dV _ A 
~dk ~ 

allows us to express 

d __A d 
dk ~ ~J?dV 

Using the above, we have 

dr wg __A dr wg __VL_ d 1 2 (Vb) 

~dk ~ ~y?~dV~ ~ _ Xc” 2 dV 2 


Finally 


n 2 A d 2 (Vb) 
c-k dV 2 


(5.78) 


Combining the above results for material and waveguide dispersion and using the Sellmeier 
equation with parameters for silica, one can determine total dispersion of fibre glass. The 
work is left as a project. 


5.5 Problems 


1. Based on ray theory, estimate the number of modes M in a multimode fibre when M 
is large. 

2. Determine the cutoff wavelength for single-mode operation of an optical fibre. As¬ 
sume the following parameters: a = 5 pm, n 2 = 1.450, A = 0.002. 

Determine the value of waveguide dispersion for the above parameters. 


3. 
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4. A fibre of 10 km length has an attenuation coefficient of 0.6 dB km -1 at 1.3 pun and 
0.3 dB km -1 at 1.55 (am. Assuming that 10 mW of optical power is launched into the 
fibre at each wavelength, determine the output power at each wavelength. 


5.6 Projects 


1. Write MATLAB code to analyse the combined effects of material and waveguide dis¬ 
persion. 

2. Using our results developed so far, establish methodologies of designing a single-mode 
fibre with predefined properties. 

3. Based on the literature, analyse polarization-maintaining fibres. 

Appendix 5A: Some properties of Bessel functions 


Here, we will provide several properties of Bessel functions needed in the main text. For 
more information on Bessel functions consult books by Arfken [7] and Okoshi [8]. Some 
general formulas are 


J m+X («) + J m ~\ («) = 2 ”j m (u) (5.79) 

K m+i (w) +K m _ l (w) = 2 -K m (w) (5.80) 

u 

2J' m =J m _ x -J m+x (5.81) 

-2K' m = K m _\ + K m+] (5.82) 

also 

J_ m = (-1 ) m J m , K_ m = K m (5.83) 


Start with (5.81) and replace J m+ \ using (5.79). One finds 
J' m (u) _ Jm-\(u) _ m 

UJ„{U ) LlJm(u) U 1 

Similarly, replacing J m _i in Eq. (5.79) one finds 

= m_ J m+l (u) 
nJ m (u ) u 2 

Similar expressions can be derived for functions K m . Using (5.82) and eliminating K„ 
one has 

K’ m (u) _ K m _\(u) m 
wK m ( u ) wK m ( u) w 2 

Finally, eliminating K m _\ in (5.82) one has 

K'Ju) _ , i fi/t m 

wK m (u ) 


(5.86) 


Km+i(u) 

' wK m (u) 


(5.87) 
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Appendix SB: Characteristic determinant 


In this appendix we provide details of the evaluation of characteristic determinant. The 
determinant which describes guided modes has been obtained before and it is given by 
Eq. (5.47): 


J m ( ica ) 0 -K m (ya) 0 

^JU(*«) i^JLOca) ^K m (ya) i°^K' m (ya) 


J m {KO) 

Pm 


~K m ( ya) 


—i - J' m (ko) ^—jJm (xa) —i— 

We will now evaluate it following the method of Cherin [15], Introduce notation 


A = J m (ica), B 
D = K m (ya), E 
The determinant can be written as 


Pm 

K 2 a 

Pm 

ay 2. 


C == (ica) 
F = f K' m (ya) 

r 


A 0 -D 0 
0 A 0 -D 
AB [iC DE iiF 
—SiC AB — e 2 F DE 


Expand the determinant. In the first step we have 



A 

0 

-D 


0 

A 

-D 

A 

nC 

DE 

HF 

-D 

AB 

H.C 

FF 


AB 

-£ 2 F 

DE 


-e\C 

AB 

DE 


and in the second step 


HF\ 

-adY c 

de \ + ad\ ab 

^ +D 2 

AB 

FC I 

DE\ 

\ab 

-s 2 F\ -ejC 

DE\ 

-£lC 

AB\ 


Evaluating the above, one has 

A 2 (D 2 E 2 + iis 2 F 2 ) - AD (-fie 2 CF - ABED ) + AD (ABDE + iie\CF) 

+ D 2 (. A 2 B 2 + neiC 2 ) = 0 

Rearranging terms 

lie 2 ( A 2 F 2 + A DCF) + /j, Sl CD (AF + CD) + 2A 2 D 2 BE + A 2 D 2 (E 2 + B 2 ) = 0 
and finally 


+ 1 ) +m£i i)+( £ + s > 2 =° 
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| Table 5.2 List of MATLAB functions for Chapter . j 

Listing 

Function name 

Description 

5C.1.1 

besselJ.m. 

Plots Bessel function J m 

5C.1.2 

besselJ.m 

Plots Bessel function Y m 

5C.1.3 

bessel K.m 

Plots Bessel function K m 

5C.1.4 

besselJ.m 

Plots Bessel function I m 

5C.2.1 

HEU.m 

Driver function for universal relations 

5C.2.2 

funeJIEW.m 

Function used by driver H E 11 

5C.3 

sellmeier.m 

Plots refractive index using Sellmeier equation 

5C.4 

dispjnat.m 

Determines and plots material dispersion 


Substitute original definitions and extracting common factors 

T J' m {Kd) K' m {ya) 1 I" kln\j' m (kq) k%n 2 2 K' m (ya) 1 _ /3 2 m 2 /J_ _J_\ 

\_KJ m {Ka) KJm(Ka) yK m (K«) J a 2 \x 2 y 2 ) 

(5.88) 


Appendix SC: MATLAB listings 


In Table 5.2 we provide a list of MATLAB files created for Chapter 5 and a short description 
of each function. 

Listing 5C.1.1 Program besseLJ.m. Plots Bessel function J m . 

°i File name: bessel_J.m 

7. Plot of Bessel functions of the first kind J_m(z) 
clear all 

N_max = 101; 7« number of points for plot 

z = linspace(0,10,N_max) ; 7, creation of z arguments 

7. 

hold on 

for m = [0 1 2] 7« m - order of Bessel function 

J = BESSELJ(m,z); 
h = plot(z,J); 

I Redefine figure properties 

xlabel(’z’,’FontSize’,22); 

ylabel(’J_m(z)’,’FontSize’,22); 

text(1.1, 0.85, ’m = 0’,’Fontsize’,18); 

text(l.8, 0.65, ’m = 1’,’Fontsize’,18); 

text(3, 0.55, ’m = 2’,’Fontsize’,18); 
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grid on 
box on 
% 

set(h,’LineWidth’,1.5); 
set(gca,’FontSize’,22); 


7. new thickness of plotting lines 
7. new size of tick marks on both axes 


end 
pause 
close all 

Listing 5C.1.2 Program besseLY.m. Plots Bessel function Y m . 

/ File name: bessel_Y.m 

7. Plot of Bessel functions of the second kind Y_m(z) 
cleax all 

N_max = 101; 7« number of points for plot 

z = linspace(0.01,10,N_max) ; 7o creation of z arguments 

% 

hold on 

for m = [0 1 2] 7o m - order of Bessel function 

Y = BESSELY(m.z); 
h = plot(z,Y); 

7. Redefine figure properties 
xlabelC’z’,’FontSize’,22); 
ylabelf’Y_m(z)’,’FontSize’,22); 
text(2.0, 0.7, ’m = O’,’Fontsize’,18); 
text(3.5, 0.7, ’m = 1’,’Fontsize’,18); 
text(5.0, 0.7, ’m = 2’,’Fontsize’,18); 
axis([0 10 -5 1.5]); 
grid on 

% 

set(h,’LineWidth’,1.5); 7« new thickness of plotting lines 

set(gca,’FontSize’,22) ; 7« new size of tick marks on both axes 

close all 

Listing 5C.1.3 Program besseLK.m. Plots Bessel function K m . 

7. File name: bessel_K.m 

7. Plot of modified Bessel functions of the second kind K_m(z) 
clear all 

N_max = 101; 7o number of points for plot 

z = linspace(0.01,3,N_max) ; 7« creation of z arguments 
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7. 

hold on 

for m = [0 1 2] 7« m - order of Bessel function 

K = BESSELK(m,z); 
h = plot(z,K); 

7. Redefine figure properties 
xlabel(’z’ , ’FontSize’ ,22) ; 
ylabel(’K_m(z)’,’FontSize’,22); 
text(0.2, 0.8, ’m = 0’,’FontSize’,18); 
text(1.0, 0.8, ’m = 1’,’FontSize’,18); 
text(1.5, 0.8, ’m = 2’,’FontSize’,18); 
axis ( [0 3 0 5] ) ; 
grid on 
box on 


set(h,’LineWidth’,1.5) ; °l new thickness of plotting lines 

set(gca,’FontSize’,22); % new size of tick marks on both axes 

end 

close all 


Listing 5C.1.4 Program besseLI.m. Plots Bessel function I m . 

% File name: bessel_I.m 

7. Plot of modified Bessel functions of the first kind I_m(z) 
clear all 

N_max = 101; 7o number of points for plot 

z = linspace(0,3,N_max) ; °/„ creation of z arguments 

7. 

hold on 

for m = [0 1 2] 7« m - order of Bessel function 

Y = BESSELI(m,z); 
h = plot(z,Y); 

7. Redefine figure properties 

xlabel(’z’,’FontSize’,22); 

ylabel(’I_m(z)’,’FontSize’,22); 

text(1.1, 1.7, ’m = 0’,’Fontsize’,18); 

text(1.75, 1.7, ’m = 1’,’Fontsize’,18); 

text(2.4, 1.7, ’m = 2’,’Fontsize’,18); 

grid on 

box on 

7. 

set(h,’LineWidth’,1.5); 7« new thickness of plotting lines 

set(gca,’FontSize’,22); 7« new size of tick marks on both axes 




134 


Linear optical fibre and signal degradation 


end 
pause 
close all 

Listing 5C.2.1 Program HEll.m. MATLAB code for finding a universal relation. The 
program determines and plots the following relations: b = b(V), 

l File name: HEll.m 

7, Function determines and plots universal relation for mode HE_11 
clear all 
format long 

l It works in the range of V = [0.50 2.4] 

N_max = 190; 
for n = 1:N_max 

VV(n) = 0.50 + n*0.01; 

V = VV(n); 

b_c(n) = f zero (IS (b) func_HEll(b,V),[0.0000001 0.8]); 

7. 

hold on 

hi = plot (VV,b_c); 7. plots b =b(V) 

7. 

tempi = VV.*b_c; 

dyl = diff(tempi)./diff(VV); 

Vnewl = W(l:length(VV)-l) ; 

h2 = plot (Vnewl, dyl) ; °/, plots first derivative 

7. 

temp2 = dyl; 

dy2 = diff(temp2)./diff(Vnewl); 

Vnew2 = Vnewl(1:length(Vnewl)-l); 

h3 = plot(Vnew2,Vnew2.*dy2); % plots second derivative 

7. 

text(2.3, 0.6, ’b’,’Fontsize’, 16); 

text(1.8, 1.2, ’{d(bV)}/{dV}’,’Fontsize’,16); 

text(0.25, 1.3, ’V{d-{2}(bV)}/{dV~{2»’,’Fontsize’,16); 

axis([0 2.5 0 1.5]); 

7. Redefine figure properties 
xlabel(’V’,’FontSize’,22); 
grid on 

set (hi, ’LineWidth’, 1.5) ; '/, new thickness of plotting lines 

set(h2,’LineWidth’,1.5); 
set(h3,’LineWidth’,1.5); 

set(gca,’FontSize’,22) ; 7. new size of tick marks on both axes 
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pause 
close all 


Listing 5C.2.2 Program funcJHEll.m. MATLAB function for finding a universal rela¬ 
tion. 

function result = func_HEll(b,V) 

7. Function name: func_HEll.m 
7. Function which defines basic relation 
u= V*sqrt(l-b); 

J1 = BESSELJ(l,u); 

JO = BESSELJ(0,u); 

7. 

w = V*sqrt(b); 

K1 = BESSELK(1,w); 

KO = BESSELK(0,w); 

lhs = J1./J0; 

rhs = (w./u).*(K1./K0); 

result = lhs - rhs; 


Listing 5C.3 Program sellmeier.m. Plots refractive index for S 1 O 2 using the Sellmeier 
equation. 

7. File name: sellmeier.m 

7. Plot of refractive index based on Sellmeier equation 
1 for Si 0_2 
clear all 

N_max = 101; 7. number of points for plot 

lambda = linspace(0.5,1.8,N_max); 7. creation of lambda arguments 

7. between 0.5 and 1.8 microns 
7. 

7. Data for Si0_2 

G_1 = 0.696749; G_2 = 0.408218; G_3 = 0.890815; 

lambda_l=0.0690606; lambda_2=0.115662; lambda_3=9.900559; 7. in microns 
7. 

terml = (G_l*lambda.~2)./(lambda.~2 - lambda_l~2); 
term2 = (G_2*lambda.~2)./(lambda.~2 - lambda_2~2); 
term3 = (G_3*lambda.~2)./(lambda.~2 - lambda_3~2); 
ref_index_sq = 1.0 + terml + term2 + term3; 
ref_index = sqrt(ref_index_sq); 

7. 

h = plot(lambda,ref_index,’LineWidth’,1.5); 

7. Redefine figure properties 
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xlabel(’wavelength (\mum)’,’FontSize’,14); 
ylabelC’refractive index’,’FontSize’,14); 

set(gca,’FontSize’,14); 7. size of tick marks on both axes 

pause 

close all 


Listing 5C.4 Program disp mat.m. Determines coefficient D mat material dispersion. 
Refractive index is determined using the Sellmeier equation. 

7, File name: disp_mat.m 
7. Plots D_mat 

7. Material dispersion is determined using Sellmeier equation 
7. describing refractive index for pure and doped silica 
7. Data from Kasap2001, p.45 for Si0_2 - Ge0_2 
cleax all 

c_light = 3d5; 7« velocity of light, km/s 

disp_min = -20; disp_max =20; 

N_max = 101; 7« number of points for plot 

lambda_min = 1.200; lambda_max = 1.600; 7« in microns 

7. creation of lambda arguments between 1.2 and 1.6 microns 
lambda = linspace(lambda_min,lambda_max,N_max); 

G_1 = 0.711040; G_2 = 0.451885; G_3 = 0.704048; 

lambda_l = 0.0642700; lambda_2 = 0.129408; lambda_3 = 9.425478; 

7. 

terml = (G_l*lambda.~2)./(lambda.~2 - lambda_l~2); 
term2 = (G_2*lambda. ~2) ./(lambda. ~2 - lambda_2“2) ; 
term3 = (G_3*lambda. ~2) ./(lambda. ~2 - lambda_3~2); 


7. 

ref_index_sq = 1.0 + terml + term2 + term3; 
ref_index = sqrt(ref_index_sq); 

7. 

7. -Determination of material dispersion D_mat 


tttl = ref_index; 

dyl_lam = diff (tttl) ./diff (lambda) ; 7o first derivative 
lambdal = lambda(l:length(lambda)-1); 
ttt2 = dyl_lam; 

dy2_lam = diff(ttt2)./diff(lambdal); 
lambda2 = lambdal(l:length(lambdal)-l); 

D_mat = (-Iambda2.*dy2_lam/c_light)*ld9; 

plot(lambda2,D_mat,’LineWidth’,1.5); 

xlabel(’wavelength (\mum)’,’FontSize’,14); 

ylabel(’Material dispersion (ps/km nm)’,’FontSize’,14); 

set(gca,’FontSize’,14); % size of tick marks on both axes 

axis([lambda_min, lambda_max, disp_min, disp_max]) 
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grid on 
pause 
close all 
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Propagation of linear pulses 



In this chapter we outline basic issues associated with the description of linear pulses. 
Basic principles of two fundamental modulation formats, namely return-to-zero (RZ) and 
non-retum-to-zero (NRZ) modulations, will also be explained. 

Formats of data transmission are one of the critical factors in optical communication 
systems. We therefore have implemented several waveforms corresponding to basic mod¬ 
ulation formats and various profiles of individual bits. Later on, some of this information 
will be used to analyse performance of communication systems. We start with a summary 
of the basic pulses. 


6.1 Basic pulses 


In this section we will review properties of the basic pulses. Basic approximations and 
properties will be described. We start with rectangular pulses. 

6.1.1 Rectangular pulses 


The simplest pulse, namely the rectangular one, is shown in Fig. 6.1 . It is described as 

-Ho? otherwise"} 

where T is the pulse duration and .s' 0 its magnitude. It can be represented as a Fourier series 
Srect (0 = f>sin(^ t) (6.2) 


where 



n even 
n odd 


(6.3) 


In Fig. 6.1 we plotted three possible approximations to a rectangular pulse for n — 1, 
n = 3, n = 5 corresponding to different number of terms in Eq. (6.2). One can observe that 
with the increasing number of terms in Fourier expansion more accurate approximation 
to a rectangular pulse is possible. MATLAB code used to create Fig. 6.1 is shown in the 
Appendix, Listing 6A. 1. 
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In practice, one often considers a symmetrical rectangular pulse shown in Fig. 6.3. It is 
defined as 

*■—«-{!: 

Its Fourier transform can be evaluated analytically as follows: 


m»£ 


- r*~*- ii* 

J-T -10) 


*") = - 


(6.5) 


where we have used Euler identity. A plot of Eq. (6.5) is shown in Fig. 6.2. 

To sum up the discussion of rectangular pulses, we have evaluated Fourier transform of 
a rectangular pulse using MATLAB built-in functions. A plot of a symmetric rectangular 
pulse and its MATLAB generated absolute value of Fourier transform are shown in Fig. 6.3. 
Plots were created by MATLAB code shown in Appendix, Listing 6A.2. 


6.1.2 Gaussian pulses 


A Gaussian pulse is considered an accurate model of the data waveforms generated in 
practical optical communication systems. The Gaussian profile is described as 

s G (t) = A e~'^ (6.6) 

o+Jln 

The pulse has an area A = PqZq, where r = ayjlrc and Pq is its maximum value (at t = 0). 
Its spectral function Sq(co) is obtained by Fourier transform and it is also a Gaussian 











(a.u.) 
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Rectangular pulse (left) and the absolute value of its Fourier transform determined using MATLAB functions. 


function, see Fig. 6.4: 


S G (co)=Ae 2 ( “ <T)2 


(6.7) 


6.1.3 Super-Gaussian pulse 


A super-Gaussian pulse is a generalization of the usual Gaussian and is 

ssdt) = — 

o\/2n 


( 6 . 8 ) 
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Super-Gaussian pulses for several values of m = 1,2, 3, 4. 


The parameter m controls the degree of edge sharpness. For m = 1 one recovers the ordinary 
Gaussian pulse. For larger values of m the pulse becomes closer to a rectangular pulse, see 
Fig. 6.5. (Figure generated by MATLAB code, in Appendix, Listing 6A.3.) 


6.1.4 Chirped Gaussian pulse 


Chirp refers to a process which changes the frequency of a pulse with time. Chirp pulses are 
most commonly created during direct modulation of a semiconductor laser diode (semicon¬ 
ductor lasers are discussed in Chapter 7). Mathematical expression for a chirped Gaussian 
pulse atz = 0 is [1], [2] 


s(z = 0, t) = Ret exp 


= e-iG 


*) cos^+ic^) 
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Gaussian pulse 



where To determines the width of the pulse, C is the chirp factor which determines the 
degree of chirp of the pulse and coo is the carrier frequency of the pulse. s(0, t) is the 
dimensionless function describing pulse. 

Phase of such pulses is 




( 6 . 10 ) 


Change in frequency Sco(t) is therefore 


8co(t) = — 


( 6 . 11 ) 


Fourier spectrum of a chirped pulse is broader compared to unchirped one. Evaluation of 
the Fourier transform of (6.9) gives 

*->-(sif ■*[-*&] 

The spectral half-width is defined as 


\m = 

Ho, 0)f, 2; 


(6.13) 


For a symmetric pulse centered at t = 0, one has A co = w e . Detailed calculations for 
spectral half-width (point where the intensity drops 1 /e) give [3] (see Problem 2) 


Acu = (1 + C 2 ) T~ l 


(6.14) 


Chirped Gaussian pulse is plotted in Fig. 6.6 (lower plot). For comparison, regular 
Gaussian pulse for the same parameters is also shown (upper plot). The plots were obtained 
using MATLAB code provided in Appendix, Listing 6A.4. 
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6.2 Modulation of a semiconductor laser 


In practical applications optical pulses are created by semiconductor laser. Typical light- 
current (L-I) characteristics of the semiconductor laser are shown in Fig. 6.7. We show here 
an output of power-current characteristics (L-I) for several temperatures. One can observe 
a flat region to the left until the so-called threshold and then sudden increase in power. 
Threshold current I th is the forward injection current at which optical gain in the laser 
cavity is equal to losses. Above the threshold, stimulated emission is the dominant process 
in converting current to light; below the threshold is the light emission which is caused 
by spontaneous processes with characteristics (speed, spectrum and efficiency) similar to 
those from light emitting diodes (LEDs). 

In Fig. 6.8 we illustrate principles of direct modulation of a semiconductor laser. It is a 
simple and inexpensive process since no other components are required. The disadvantage 
of direct modulation is that the resulting pulses are considerably chirped. 

6.2.1 Modulation formats 


Various modulation formats are often discussed in literature. For detailed analysis consult 
books by Keiser [4], Liu [5], Binh [6], Ramaswami or Sivarajan [7], Here, we discuss briefly 
two important cases only as illustrated in Figs. 6.9 and 6.10. 

At a digital format light pulse intensity (say) larger than zero, represents logical ONE and 
its absence, logical ZERO. In the retum-to-zero (RZ) format, signal drops to zero between 
the pulses, see Fig. 6.10. Each bit has allocated time T. In RZ, pulses occupy half of the 
time slot reserved for each bit, see Fig. 6.10. Also, power spectrum is shown. Since most of 
the signal power is at the lower frequency, the RZ signal is transmitted by a system having 
a bandwidth of j Hz. 

In the non-retum-to-zero (NRZ) format, the pulses occupy entire time slot reserved for 
each bit. NRZ pulses are twice as long as the RZ pulses. In NRZ modulation format the 
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NRZ 


0 



Comparison of RZ and NRZ modulation formats. Digital bit stream is: 01010. 


intensity of light remains at its high value whenever a number of consecutive ONEs are 
transmitted. The bandwidth is 1/ T, only half of that required for RZ system. 

For encoding, practical current pulse driving the semiconductor laser is assumed to be 
of the form [8] 


7(f) = 


0 

I m [1 - exp {-t/t r )] 
J m exp (—(f — T)/t r ) 


t < 0 


0 <t<T' 
T' < t 


(6.15) 


where I m is the peak modulation, t r determines the pulse rise time, and T' = T for NRZ 
encoding and T' = T /2 for RZ coding. The bias current I bias is typically 1.1 times the laser 
threshold current. 
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1/T 2/T f 


1/T 2/T f 


Comparison of modulation formats (RZ and NRZ) (schematically) and their corresponding bandwidths. 

Individual pulses 



Train of Gaussian pulses for logical combination of bits 010110. Individual pulses are shown (top) and also complete 
signal (bottom). 


6.2.2 Creation of waveforms 


In practice, to evaluate photonic systems for some modulation formats one needs to generate 
waveforms consisting of pulses of various shapes. 

The generated sequence of Gaussian pulses (waveform) for some particular logical com¬ 
bination is illustrated in Fig. 6.11. MATLAB code is provided in Appendix, Listings 6A.5 
and 6A.5.1. 
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time [s] 

Eight-bit pattern generated in NRZ format. 


Another wavefront which is used to represent driving current of semiconductor lasers in 
NRZ (non-return-to-zero) format can be expressed as [9] 


J mod 


(0 = 


' hia S +4 0 
hia S +I m e- 2 ^ 

Ibias 

Jbias+L 


current bit 1, previous 0 
current bit 0, previous 1 
current bit 0, previous 0 
current bit 1, previous 1 


(6.16) 


Implementation of the above modulation format is shown in Appendix, Listing 6A.6 and 
typical result in Fig. 6.12. 


6.3 Simple derivation of the pulse propagation equation in the 
presence of dispersion 


Assume the existence of a propagating electric field E(r, t ) at point r and time t. It can be 
Fourier decomposed as 

E(r, t)=^- f E(r, co) e~ ia>t dco (6.17) 

2n J_ OQ 

Therefore electric field can be interpreted as an infinite sum of monochromatic waves, each 
having an angular frequency co. The inverse Fourier transform is 


E(r, co) = 


/: 


E(r, t) e J0,t dt 


(6.18) 
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We further assume that Fourier component E(r, w) can be expressed as 

E(r, oo) = xB( 0, oo) F(x,y ) e> pz (6.19) 

where unit vector x represents the state of polarization of propagating field, 5(0, w) is the 
initial amplitude of the field propagating along z-direction, Fix, y) describes the transverse 
distribution of the field and ft is the propagation constant. We made a similar assumption 
earlier when we discussed propagation of the EM field in planar waveguides. 

For the field propagating in an optical fibre, the transverse distribution F(x,y ) is often 
assumed to be Gaussian, namely 

F(x,y) = exp (6.20) 

where A e jj is the effective area of the fibre. 

The effect of dispersion results in the frequency dependence of the propagation constant 
ft; i.e. ft = ft (&>)• As a result, each frequency component in Eq. (6.17) will propagate with 
a different propagation constant. 

To derive propagation equation for each frequency component, write Eq. (6.19) as 

E(r, oo) = E (x,y, z = 0, oo) e ipz (6.21) 


From above, by direct differentiation with respect to z, one finds 
3E(r, co ) 

—^—~ = j7K<w)E(r, co) (6.22) 

The above equation describes propagation of EM field with the dispersion induced, broad¬ 
ening included. More rigorous derivation of the above equation, starting from Maxwell’s 
equations, will be performed in the next section. 

Propagation constant ft(co) in terms of the frequency dependent mode index niw) is [3] 


ft(fi>) = n(co) — 


(6.23) 


It is customary to expand ft (&>) around the centre frequency of the pulse wo in a Taylor 
series 

ft(co) = ft 0 + fti (w — coo ) + -ft 2 {co - (o 0 ) 2 + -ft3 (<w - &>o) 3 (6.24) 


where ft, = . Expansion up to the third order is sufficient to describe typical effects 

existing in optical fibre. More practical expressions for expansion coefficients are [3] 


nM + Jjp.] = "jL = ± 

aw J c v g 


(6.25) 


where n g is the group index and v g group velocity. For coefficient ft 2 one finds 

J 2 ft(w) = 3/3, = 1 r dn(w) d 2 nia>) 1 
2 dw 2 dw c [ dw W dw 2 \ 


(6.26) 
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Coefficient /L is related to dispersion parameter D as 

1 dr 2ire „ 


(6.27) 


Here r is the group delay through a fibre of length L, and A is the wavelength. Units of 
parameter D are 

To account for parameter /J 3 in practice, the so-called dispersion slope S is introduced: 


S 


dD 

~dX 


(6.28) 


Parameter S describe the variation of D with the wavelength. Units of it are [ „j2. km \ ■ 


6.4 Mathematical theory of linear pulses 


Here, we will outline a full derivation of the equation describing linear pulses based on 
Maxwell equations. This more formal derivation illustrates needed approximations. 

Maxwell’s equations were introduced in Chapter 3; for optical fibre where there is no 
flow of current and no free charges these are 


and 


V x E 


<)B 
dt ’ 


(6.29) 


V • D = 0, V • B = 0 


(6.30) 


Constitutive relations are expressed 


D = £ 0 ® ”1“ P, O — /xq H -(- M 


(6.31) 


Here P is polarization which is divided into linear and nonlinear parts: 


P(r,t) = Pi(r,0+Pvz(r,0 


(6.32) 


For a nonmagnetic medium such as fibre, the magnetization M = 0. In this chapter we 
discuss only linear effects. 

Linear polarization is related to electric field as 


Pi(r, t) = so J x (1) (f — O E(r, t') dt' 

Fourier transform and its inverse are introduced as 

E(r, (o) = J E(r, t) e 1( °‘dt 

E(r, t) = f E(r, co) e mt dco 

2tx J -oo 


(6.33) 


(6.34) 


(6.35) 
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Fourier transformation of linear polarization, Eq. (6.33) gives 
Pi(r,£B) = j P L (r, 0 e ia> ‘dt 

= So / J x (X \t - t') E(r, t') e mt dt'dt 
= eoj°° f X (1) if") E(r, t') d mt " e i(at 'dt'dt" 

= s 0 J dt'E(r, t') e mt ' j dt"x 0, (t") e uot " 

= e 0 X (1) («)E(r,ft>) (6.36) 


where we have made change of variables t — t' = t". x n) (oj) is the Fourier transform of 
the susceptibility x (1) (0 

X (1) (co) = j°° dtx il) (t) e wt (6.37) 

To derive wave equation take V x ... operation of the first Maxwell’s equation and use the 
second Maxwell’s equation and constitutive relations. In a few steps, one obtains 


9 

V x V x E(r, t) = - —V x B(r, t) 

= -^MoV x H(r, t) 

9 2 D(r, t) 

9 2 E(r, t) 9 2 P(r,0 

- -^0 ^ /xo ^ 


In the linear regime 
V 


Using mathematical identity 


V x V x E = V (V • E) - V 2 E 


and third Maxwell’s equation (V • D = eV • E = 0), one finally obtains 
v.BM + ^ + w %1> = 0 

The above is a linear equation in field E(r, t ). Taking Fourier transform gives 


(6.38) 


V 2 E(r, o>) + ^-E(r, co) + fiocu 2 Pi(r, co) = 0 (6.39) 

c 2 

Finally, substituting the result in (6.36) for linear polarization gives the wave equation in 
the frequency domain 

V 2 E(r, co) + e(&)) C ^r-E(r, co) = 0 (6.40) 

c 2 

The above three-dimensional wave equation will be developed in one dimension next. 
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6.4.1 One-dimensional approach 


A one-dimensional wave equation (in time domain) for a linear medium with zero dispersion 

— - (M1) 

where ho is refractive index which in the absence of dispersion is frequency independent. 
It is related to propagation constant p 0 as 

Po = n 0 k 0 (6.42) 

The wave equation describes pulses with central frequency coq. The following relation holds: 

k 0 = «oV/ 2 o£o (6.43) 

Solution of the wave equation which describes linear pulse is 

E(z, t) — A(z, t) (6.44) 

where A (z, t) is the pulse envelope. In the following we will work within slowly varying 
envelope approximation (SVEA), which is based on the following approximations: 




3 A{z,t)\ 

| d 2 A(z, t) 


r 3- 

1 d 2 A(z, t) 1 


9A(z, t) 1 

3 1 2 




One needs to evaluate derivatives appearing in Eq. (6.41) using solution given by Eq. (6.44) 
and then apply SVEA. The resulting linear wave equation describes pulses propagating 
without dispersion and it is 

3A(z, t) p 0 8A(z, t) _ 


b — „ =0 


(6.45) 


dz coo 3 1 

In the presence of dispersion it is convenient to represent pulse in terms of Fourier compo¬ 
nents. Each such component travels with different velocity. 

Taking Fourier transformation of the original pulse described by Eq. (6.44) gives 


E(z,t) — — [ E(z, co) e mt dco 

J -oo 


(6.46) 


E (z, co) describes propagation of spectral component with angular frequency co inside the 
fibre as follows [1]: 

E(z,co) =E(0,(o)e~ ilSz (6.47) 

Combining Eqs. (6.44), (6.34) and (6.47) gives 

A(z, t) e l(a,ot - paZ) = [ E(0,co) e~ ipz e ia>, dco 

Z7T J-oo 
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From the above, the expression for pulse envelope A (z, t ) is 

A(z,t)=— [ E{ 0, cu) e- iip ~ M: dco (6.48) 

2?r j _ 0 o 

Next, let us introduce Aa> = co — co 0 and assume A co <?C co 0 . 

Pulse broadening results from frequency dependence of the propagation constant P; i.e. 
P = P(co). Because of A co <JC coo, it is customary to expand P around frequency co o as 
follows [the same equation as (6.24)]: 

P(co) = p 0 + p 1 Aco+ \p 2 (Aft)) 2 + \pi (A«) 3 + ... (6.49) 

z 6 

where higher order terms are neglected. The following notation has been introduced: 
Po = /6(ft) 0 ), A = ^| _ > A = 0| _ , Pj, = 01 . Substituting the above 

expansion into Eq. (6.48) gives 

A (z, t) — f E (0, «) e‘ Aorf 

Z7T ,/_ O0 


The above expression will be used to evaluate derivatives AA ^f ] and 3 ^ ,f) . One finds 

a piz e -i(to?ilto e -i(A*?llb* da) ( 6 . 50 ) 


3^(M) 1 fi 00 - , , 

~^ = ^L m ’ co)lAcoe 


^ = T- J°° E(0, &))(—z'Aft)(i|) e ,Aa " e-'^cT'^^e-^^dco 

+ ± f°°J( 0,ft)) [-ii(Aft,) 2 A] ^ e^e-it^n^r^n^dfo 

x e iAa>t (6.51) 


Terms on the right hand side of the above equation can be expressed in terms of the time 
derivatives as follows: 


T; /_°° E^coK-iAcoP!) e iA °>‘ 


i- y°°f(0,ft)) [-/I(Aft,) 2 ft] ^ 

1 9^0 

If^co) [-^(Aft)) 3 ftJ e iAa>t e~ iAa>fS *e 


,-KAa>flhz e -KA'o?y h z d(o 
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Using the above replacements in Eq. (6.51) finally gives 

3A(z, t) BA(z, t) 1 d 2 A(z, 0.1 d 3 A(z, t) 

= '2 fc_ 3^ + 6 A ^^ <6 ' 52) 

This is the basic equation which describes pulse evolution in the linear regime in the 
presence of dispersion. Its generalization to include the nonlinear effect (Kerr effect) which 
describes optical solitons will be discussed in Chapter 15. 

In the absence of dispersion, i.e. when = Pi = 0, the optical pulse propagates without 
changing its shape. In such case pulse envelope changes as A(z, t) = A(0, t - fi\z). 

The general equation describing pulse propagation can be simplified by going to a 
reference frame moving with the pulse. The transformation to the new coordinates is 

t' = t — P\z) and z! = z (6.53) 

In the reference frame moving with the pulse, Eq. (6.52) takes the form 

dA(z / , t') i d 2 A(z / , t’) \ o #A&,t!) , eeA ^ 

+ ~2 h dt a ~ 6 th (6 ' 54) 

The above equation found many applications which have been discussed extensively, see 
[3], [1], [9]. 


6.5 Propagation of pulses 


6.5.1 Analytical description of the propagation of a chirp Gaussian pulse 

Pulse behaviour is described by the Eq. (6.54) derived previously. For /I3 = 0 the equation 
is 

dA +‘« d2A 0 
a? + 2 ft 3^ =0 


and pulse evolution can be determined analytically. The solution is 

A(z,t) = f A(0, co) e^ 20>2z ~ mt d(o 

2 Tt J-oo 


(6.55) 


In the solution, the initial Gaussian pulse A(0,cu) is given by Eq. (6.12). Substituting 
Eq. (6.12) into (6.55) and evaluating the Gaussian integral gives 

T 0 [ (1 +iC)t 2 


A(z, t) = - 


2 ex P i — T 


(6.56) 


[T 2 - ifaz (1 + iC)] 1/2 ’ " l T 2 - ip 2 z (1 + iC) J 
The above result defines new pulse width 7) for a propagating pulse at position z. Its value 


5-[0 




(h* 

v n 


)T 


Evolution of chirped and unchirped pulses is shown in Fig. 6.13. Consult listing 6A.7 
for MATLAB code. 
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3 



0 


0 


0.5 


1.5 


Evolution of pulse width as a function of normalized distancez/ftj for chirped and unchirped pulses. Dispersian 


length is defined by Eg. (15.14). 


6.5.2 Numerical method using Fourier transform 


We finish this chapter by outlining the numerical method of using Eq. (6.54) to describe 
propagation of a Gaussian pulse (and also a super-Gaussian pulse) in the presence of 
dispersion. The method consists of the following steps. 

1. The single pulse p = exp(—t * 1 2 /To ) is used with sampling period of T = 0.08.? in the 
time domain. (If needed it can be plotted using plot(t, p) MATLAB function.) 

2. The Fourier transform of this single pulse is found using P = f ftshi f t ( f f t(p\) code 
where p is in time domain and P is in the frequency domain. 

3. The frequency domain pulse is sampled using sampling frequency = 1/64. The solution 
in frequency domain is found by multiplying Fourier transform of single pulse and transfer 
function H (co) where spectral function is 



(6.58) 


where a are losses, L is the propagation length and ft and ft are previously defined 
dispersion parameters. 

4. The solution in time domain is obtained by using inverse Fourier transform 
if ft(f ftshift ( P_prop)) and it is plotted. Consult Listings 6A.8 and 6A.8.1 for MAT- 
LAB code. 

The propagation of Gaussian pulse over a distance of 4000 km is shown in Fig. 6.14 for 
the following parameters: a = 0, ft = 60ps 2 /km and ft = 0.01 ps 3 4 /km. MATLAB code 
is provided in Appendix, Listings 6A.8.1 and 6A.8.2. 
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Dispersion induced broadening of a Gaussian pulse. = 60 ps 2 /km and /J 3 = 0.01 ps 3 /km. 



^ Evolution of a super-Gaussian pulse with m = 3.j3 2 = Oand /} 3 = 0.01 ps 3 /km. 


In Fig. 6.15 we show propagation of a super-Gaussian pulse with m = 3 over the same 
distance with /f 2 = 0. The remaining parameters remain unchanged. 

6.5.3 Fourier transform split-step method 


This method will be extensively discussed in later Chapter 12 describing beam propagation 
method (BPM). Here we use it to illustrate propagation of Gaussian pulse. First, we briefly 
describe the method. 















Problems 


Table 6.1 Values of parameters. 

Name 

Symbol 

Value 

Second-order dispersion 

h 

60 ps/nm _1 km -1 

Third-order dispersion 

ft 

0.01 ps/nm _1 km -1 

Losses 

a 

0 

Width of Gaussian pulse 

w 0 

20 ps 


We use the linear Schroedinger equation derived previously, Eq. (6.54). Taking Fourier 
transform results in 

3 A(z, co) iB2 , ~ /h , ~ a ~ 

-r-= — (~ico) 1 2 A(z, co) + ^ (~ico) 3 A(z, co) - - A(z, co) (6.59) 

dz 2 6 2 

where a describes losses. Fourier transform is defined in a usual way as 


A(z, co) = J A(z, t) e mt dt 
Eq. (6.59) can be expressed as 


3 A(z, co) 
dz 


= P • A(z, co) 


where the propagator operator P is defined as 

P=f^+^-| ( 6 . 61 ) 

Eq. (6.60) can be integrated over small distance h along the z-axis with the result 


A(z + h,co) = e Ph A(z, co) 


(6.62) 


The above equation is directly implemented in MATLAB. In Table 6.1 we summarized 
values used in simulations. 

The results are shown in Figs. 6.16 and 6.17. We show evolution of the Gaussian pulse 
and also a three-dimensional view of the evolving pulse. MATLAB code is provided in 
Appendix, Listing 6A.9. 


6.6 Problems 


1. Evaluate Fourier series for rectangular pulse represented by Eq. (6.1). Determine 

coefficients s„. 

Evaluate Fourier transform of chirped Gaussian pulse and its spectral half-width A co 
as given by Eq. (6.14). 


2. 
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Evolution of Gaussian pulse by Fourier transform split-step method. 



Three-dimensional view of the evolution of the Gaussian pulse shown in Fig. 6.16. 


Appendix §A: MATLAB listings 


In Table 6.2 we provide a list of MATLAB files created for Chapter 6 and a short description 
of each function. 
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Table 6.2 List of MATLAB functions for Chapter . 

Listing 

Function nam 

e Description 

6A.1 

rp.m 

Creates approximations to rectangular pulse 

6A.2 

FTj-p.m 

Evaluates FT of rectangular pulse using analytical expression and 
also MATLAB functions 

6 A. 3 

super gauss.n 

t Plots super Gaussian pulses 

6A.4 

gaussjchirp.tr, 

i Plots chirp Gaussian pulses 

6A.5.1 

ptrain.m 

Generates sequence of Gaussian pulses 

6A.5.2 

fgauss, m 

Function used by ptrain.m 

6 A. 6 

bit-gen.m 

Illustrates NRZ modulation format 

6A.7 

pulse-evol.m 

Evolution of pulses (chirped and unchirped) 

6A.8.1 

pdisp.m 

Width of Gaussian pulse with dispersion 

6A.8.2 

gauss jn.m 

Function used by pdisp.m. Defines Gaussian pulses 

6A.9 

sslg.m 

Evolution of Gaussian pulse by Fourier transform split-step method 


Listing 6A.1 Function rp.m. Function which draws rectangular pulse and calculates 
approximations based on Fourier series for three values of n. 


I File name: rp.m (rectangular pulse) 
clear all 
N_max = 100; 

7 . Pulse characteristics 
s_0 = 1.0; 
t_min = 2; 
t_max = 10.0; 

A = 0.5; 
dt = 1/N_max; 
t = 0:dt:2*t_max; 

x = A*(sign(t-t_min)-sign(t-t_max)); 

hold on 

h = plot(t,x) ; 

set(h,’LineWidth’,2); 

axis([0 1.2*t_max -0.5 1.7]); 

% 

s = 0.0; 

for n = [13 5] 

s = s + 4*s_0/(pi*n)*sin(n*pi*(t-t_ 
h = plot(t,s) ; 
set(h,’LineWidth’,2); 

end 

axis ( [0 1.2*t_max -0.5 1.7]); 
xlabelC’time (a.u.)’,’FontSize’,14); 
ylabel(’s (a.u)’,’FontSize’,14); 
set(gca,’FontSize’,14); 


7, number of points for plot 

7, pulse amplitude 
7. start of pulse 
% end of pulse 


l creates rectangular pulse 


l plots rectangular pulse 


_min)./(t_max-t_min)); 


7. size of x label 
7, size of y label 
7. size of tick marks on both axes 
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°i Add description 
text(5.5, 1.4, ’n 
text(7.0, 1.4, ’n 
text(8.5, 1.4, ’n 

close all 


= 1’,’Fontsize’,13) 
= 3’,’Fontsize’,13) 
= 5’,’Fontsize’,13) 


Listing 6A.2 Function FT_rp.m. Function plots rectangular pulse, its Fourier transform 
obtained analytically, and also evaluates its Fourier transform using MATLAB built-in 
functions. 

"A File name: FT_rp.m 

’/, Plots symmetric rectangular pulse of duration tau and its 
’/, Fourier transform 
clear all 

N_max = 1000; "/, number of points for plot 

’/, Pulse characteristics 

tau = 10.0; "/. 1/2 pulse width 

A = 0.5; "/, pulse height 

"/. Plotting Fourier transform of rectangular pulse using analytical formula 
omega = linspace(-20/tau, 20/tau, N_max); "/, creation of frequency argument 

S = 2*A*sin(tau.Iomega)./omega; 
h = plot(omega,S); 

set(h,’LineWidth’,1.5); "/, thickness of plotting lines 

grid 

xlabelC’frequency (a.u.)’,’FontSize’,14); 

ylabelC’Fourier transform (S) (a.u.)’,’FontSize’,14); 

set (gca,’FontSize’,14); ’/, size of tick marks on both axes 

text(0.65, 0, ’A’,’Fontsize’,13); 

text(l, 0, ’B’,’Fontsize’,13); 

pause 

close all 

-Plotting rectangular pulse- 

dt = 1/N_max; 

t = -6*tau:dt:6*tau; "/. Creation of points for pulse plotting 

s = A*(sign(t+tau)-sign(t-tau)); 
h = plot(t,s); 

set (h, ’LineWidth’ ,1.5); "/, thickness of plotting lines 

axis([-2*tau 2*tau 0 2]); 

xlabel(’time (a.u.)’,’FontSize’,14); ylabel(’s_r (a.u.)’,’FontSize’,14); 
set (gca,’FontSize’, 14); "/, size of tick marks on both axes 

pause 
close all 

"A-Plotting its Fourier transform using Matlab functions- 

y = fft(s); "/, discrete Fourier transform 

SM = fftshift(y); "/. shift zero-frequency component to center of spectrum 
N = length(SM); 
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k = -(N-D/2: (N-l)/2; 
f = abs(SM); 
h = plot(k,f); 

set(h,’LineWidth’,1.5); 7, thickness of plotting lines 

axis([-4*tau 4*tau -5000 30000]); 

ylabeK’Abs FT of rectangular pulse (a.u) ’, ’FontSize’ , 14); 
grid 

set(gca,’FontSize’, 14) ; 7, size of tick marks on both axes 

close all 


Listing 6A.3 Function super gauss.m. Plots super-Gaussian pulses for several values of 
parameter m. 

I File name: super_gauss.m 
% Plots super-Gaussian pulses 

A = 1; 
sigma = 30; 

N = 300.0; 

t = linspace(-50,50,N); 
hold on 

for m = [1 2 3 4] 

p = A/(sigma*sqrt(2*pi))*exp(-t.~(2*m)/(sigma~(2*m))); 
h = plot(p); 

set(h,’LineWidth’,1.5); '/, thickness of plotting lines 

end 

xlabelC’time’,’FontSize’,14); ylabel(’amplitude’,’FontSize’,14); 
set(gca,’FontSize’,14); % size of tick marks on both axes 

close all 


Listing 6A.4 Function gauss_chirp.m. Generates Gaussian and chirp Gaussian pulses. 
”/, File name: gauss_chirp.m 

% Creation of Gaussian pulse and also chirped Gaussian pulse 
clear all; 

7. 

t_zero = 20.0; 7, center of incident pulse 

width = 6.0; 7, width of the incident pulse 

C = 15; % chirped parameter 

% 

N = 300.0; 
time = linspace(0,50,N); 
pulse = exp(-0.5*((t_zero 


- time)/width).~2); 


7. Gaussian pul: 
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pulse_ch = exp(-(0.5+li*C)*((t_zero - time)/width).~2); 7. chirped pulse 

7. 

subplot (2,1,1), h = plot(time,pulse); 

set(h,’LineWidth’,1.5) ; 7. thickness of plotting lines 

title(’Gaussian pulse’,’FontSize’,14) 

xlabel(’time’,’FontSize’,14); ylabel(’amplitude’,’FontSize’,14); 
set(gca,’FontSize’,14); 7. size of tick marks on both axes 

7. 

subplot (2,1,2), h = plot(time,pulse_ch); 

set(h,’LineWidth’,1.5) ; 7. thickness of plotting lines 

title(’Chirped Gaussian pulse’,’FontSize’,14) 

xlabel(’time’,’FontSize’,14); ylabel(’amplitude’,’FontSize’,14); 
set(gca,’FontSize’, 14) ; 7. size of tick marks on both axes 

pause 
close all 


Listing 6A.5.1 Function ptrain.m. Generates a train of pulses. 


7. File name: ptrain.m 
7, Generates train of pulses 
7. Allows for overlap of pulses 
clear all 

bits = [010110]; 

T = ld-9; 

num_pulses = length(bits); 

N = 1000; 
width = 0.3*T; 

time=linspace(0,num_pulses*T,N); 

signal = zeros(l.N); 

pulses = zeros(num_pulses,N); 

7. 


"/, definition of logical pattern 
"/, pulse period [s] 

% number of pulses 
"/, number of time points 
7. width of pulse 


■ignal 


t_0= T/2; '/.position of peak of first impulse in 

for i=l:num_pulses 
if (bits(i)==l) 


pulses(i,:)= fgauss(time,width,t_0); 
signal = signal + fgauss(time,width,t_0); 


t_0 = t_0 + T; 




7. 


subplot (2,1,1); h = plot(time.pulses); 
set(h,’LineWidth’,1.5) ; 7, 

xlabel (’ time ’ , ’ FontSize ’ , 14) ; 7. 

title(’Individual pulses’,’FontSize’,14) 
set(gca,’FontSize’, 14); 7. 

subplot (2,1,2); h = plot(time,signal); 
set (h,’LineWidth’,1.5); 7. 


thickness of plotting lines 
size of x label 

size of tick marks on both sixes 
thickness of plotting lines 
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xlabel(’time’,’FontSize’,14); 7. size of x label 

set(gca,’FontSize’,14); 7. size of tick marks on both axes 

title(’Total signal’,’FontSize’,14) 

pause 

close all 

Listing 6A.5.2 Function fgauss.m. Function used by program ptrain.m. Function creates 
individual Gaussian pulses. 

function pulse = fgauss(time,width,t_0) 

7, function pulse = wave_gauss (t ,T_period) 

7. Generates individual Gaussian pulse used in the creation of 
l train of pulses 

pulse = exp(-0.5*((t_0 - time)/width).~2); 

Listing 6A.6 Function bits_gen.m. Function generates an 8-bits long bit pattern. 

7. File name: bits_gen.m 

7.- 

i. Purpose: 

7. Generates 8-bits long pattern 

7. generates single bit which is then repeated 8 times 
7. Source 

7. R.Sabella and P.Lugli 

7. "High Speed Optical Communications" 

7. Kluwer Academic Publishers 1999 
7. p.32 


clear all 

bits =[01001101]; 7. Definition of logical pattern 

7. 

T_period = ld-9; 7. pulse period [s] 

I_bias = 2; 7. mA 

I_m = 3; 7. mA 

tau_r = 0.2*T_period; 7, rise time 

7. 

7. Generation of current pattern corresponding to bit pattern 
I_P = 0; 

N_div = 50; 7, number of divisions within each bit interval 

t = linspace(0, T_period, N_div) ; 7. the same time interval is 

7. generated for each bit 

%******** first bit ************************************* 
if bits(l)==0, I-P-l = I_bias + 0*t; 

elseif bits(l)==l, I_p_l = I_bias + I_m + 0*t; 

7- 
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°l Generates single, arbitrary bit 
°/.t = linspace(0, T_period, N_div) ; 
temp_I = I_p_l; 

I 

number_of_bits = length(bits); 

% 

for k = 2:number_of_bits 

if bits(k)==l && bits(k-l)==0 

I_p = I_bias + I_m*(l - exp(-2.2*t./tau_r)); 
elseif bits(k)==0 && bits(k-l)==l 

I_p = I_bias + I_m*exp(-2.2*t./tau_r); 
elseif bits(k)==0 && bits(k-l)==0 

I_p = I_bias + 0*t; 
else bits(k)==l && bits(k-l)==l 

I_p = I_bias + I_m + 0*t; 

end 

I_p = [temp_I,I_p]; 
temp_I = I_p; 

temp_t = t; 

for k = 2:number_of_bits 

t = linspaceCO, T_period, N_div); 
t = [temp_t,(k-l)*T_period+t]; 
temp_t = t; 

% 

x_max = max(t); 
y_min = I_bias; 
y_max = I_bias + 1.2*I_m; 

% 

h = plot(t,I_p); 
grid on 

set(h, ’LineWidth' ,1.5) ; 7 thickness of plotting lines 

xlabelC’time [s]’,’FontSize’,14); 7 size of x label 

ylabel(’Current [mA]’,’FontSize’,14); % size of y label 

set(gca,’FontSize’,14); % size of tick marks on both axes 

axis([x_min x_max y_min y_max]) 

pause 

close all 

Listing 6A.7 Function pulse evol.m. Function describes evolution width of a chirped 
pulse. 

'/, File name: pulse_evol.m 

7, Describes evolution of pulse width as a function of normalized 
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7, distance for chirped and unchirped pulses 


clear all 
7 0 Data 

beta_2 = -20; 

T_0 = 200; 

L_D = (T_0~2)/abs(beta_2); 
N_max =401; 

x = linspace(0,1.5,N_max); 


7. [-20ps~2/km] at 1.55 microns 
7. [ps] = 0.2 ns, bit rate lOGb/s 

7, number of points for plot 
7. normalized distance; x = z/L_D 


hold on 

for C = [-1 0 12] 7o chirped coefficient 

T = sqrt((l+sign(beta_2)*C*x).~2 + (x).~2); 
h = plot(x,T); 

set(h,’LineWidth’,1.5); 7. thickness of plotting lines 

end 

7. Redefine figure properties 
ylabel(’T_1/T_0’,’FontSize’,14) 
xlabel(’z/L_D’,’FontSize’,14) 
text(0.35, 1.65, ’C = -1’,’Fontsize’,14) 
text(0.35, 1.2, ’C = O’,’Fontsize’,14) 
text(0.35, 0.8, ’C = 1’,’Fontsize’,14) 
text(0.35, 0.55, ’C = 2’,’Fontsize’,14) 

set(gca,’FontSize’,14); 7. size of tick marks on both axes 

pause 

close all 


Listing 6A.8.1 Function pdisp.m. Function describes the evolution of a Gaussian pulse 
with dispersion. 


7. File name: pdisp.m 

7. Propagation of Gaussian pulse in the presence of dispersion 
7. using Fourier transform 
clear all; 

/ Creation of input pulse 


T_0 = 1.0; 

T_s = 0.08; 
t = -4:T_s:4-T_s; 
p = gauss_m(t,T_0); 

P=fftshift(fft(p)); 
Fs=l/64; 

N=length(t); 

f = -N/2*Fs:Fs:N/2*Fs-Fs; 


7. pulse width 

7 , sampling period 

7. creation of time interval 

7. Fourier transform of the original pulse 
7. sampling frequency 
7. length of time interval 
7. frequency range 


omega = 2*pi*f; 

7, Parameters of optical fiber 



164 


Propagation of linear pulses 


alpha = 0.0; 7. losses 

beta_2 = 60.0; 7. coefficient beta_2 [ps"2/km] 

7.beta_2 = 0.0; 7, coefficient beta_2 [ps~2/km] 

beta_3 = 0.01; ’/, coefficient beta_3 [ps~3/km] 

7. Transfer function of optical fiber 
distance = 4000; 

H=exp((alpha/2+li/2*beta_2*omega.~2+li/6*beta_3*omega.~3)*distance); 

7. 


P_prop = P.*H; 7. Fourier transform of final pulse 

p_prop = ifft(fftshift(P_prop)) ; 7o time dependence of final pulse 
p_plot = abs(p_prop).“2; 

7. 

h = plot(t, p, t, p_plot); 

set(h,’LineWidth’,1.5); l thickness of plotting lines 

xlabel(’time’ , ’FontSize’ ,14); 7. size of x label 

ylabel(’Arbitrary units’,’FontSize’,14); 7. size of y label 

set(gca,’FontSize’,14); 7. size of tick marks on both axes 

legend(’original pulse’, ’transmited pulse’) 

close all 


Listing 6A.8.2 Function gauss m.m. Function used by pdisp.m. 
function p = gauss_m(t,T_0) 

7. Definition of Gaussian and super-Gaussian pulses 

m = 1; 7. m=l, usual Gauss; m=3, super-Gauss 

7.m = 3; 

p = exp(-t.~(2*m)/(2*T_0~(2*m))); 


Listing 6A.9 Function sslg.m. Function creates a Gaussian pulse in the time domain 
which is subsequently Fourier transformed to the frequency domain and then propagated 
over a linear system with dispersion characterized by ft and ft. 


7. File name: sslg.m 

7. Calculates and plots the evolution of a Gaussian pulse in optical fiber 
7. using Fast Fourier Transform split-step method with linear terms only 
clear all 

7. Input parameters 

N= 32; 7. number of points along time axis 

T_domain = 100; % total time domain kept [in ps] 

beta_2 = 2; 7o dispersion coefficient [in ps/nm-km] 

beta_3 = 1.01; 


Delta_t = T_domain/N; 

Delta_om = 2*pi/T_domain; 
t = Delta_t*(-N/2:1:(N/2)-l); 
omega = Delta,om*(-N/2:1:(N/2)-l); 


l node spacing in time 
I node spacing in radial frequency 
l array of time points 
i array of radial frequency points 
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t_FWHM_0 = 20; 

P_0 = 1; 

A_0 = sqrt(P_0); 

T_0 = t_FWHM_0/(2*sqrt(log(2))); 
gauss = A_0*exp(-t.~2/(2*T_0~2)); 

X 

L_D = 200; 

z_plot = [0 0.25 0.5 0.75 1.0]*L_D; 
gauss_F = fftshift(abs(fft(gauss))) 
z = 0; 
n = 0; 
hold on 

for z_val = z_plot 

X P -propagator function 

P = exp(((lj/2)*beta_2*(omega.* 

u_F_z = gauss_F.*P; 

u_z = ifft(u_F_z,N); 

u_abs_z = abs(u_z).~2; 

u = fftshift(u_abs_z); 

u_3D(:,n) = u’; 

plot(t,u,’LineWidth’,1.5) 

end 

grid on 

xlabel(’time [ps]’,’FontSize’,14); 
set(gca,’FontSize’,14); 
pause 
close all 

X 

X Make 3D plot 

for k = 1:1:length(z_plot) 

y = z_plot(k)*ones(size(t)); 
plot3(t,y,u_3D(:,k),’LineWidth’ 
hold on 

end 

xlabel(’time [ps]’,’FontSize’,14); 
set(gca,’FontSize’,14); 
grid on 

close all 


X initial pulse FWHM [in ps] 

X initial peak power [in mW] 

X initial amplitude of the Gaussian 
X initial pulse standard deviation 
X initial Gauss pulse in time 

X z-values to plot [in km] 

;X initial Gauss in frequency 
X starting distance 
X controls stepping 

X for selected z-values 
X creates new step 

2)+(lj/6)*beta_3*(omega.~3))*z_val); 
X propagation at point z 
X takes inverse Fourier transform 

X shifts frequency components 
X create array for 3D plot 


ylabel(’amplitude’,’FontSize’,14); 
X size of tick marks on both axes 


X choosing 3D plots every step 
X spread out along y-axis 
,1.5) 

ylabel(’distance [km]’,’FontSize’,14); 
X size of tick marks on both axes 
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7 Optical sources 

___ 


In this chapter, we will give a basic introduction to optical sources with the main emphasis on 
semiconductor lasers. The bulk of our description is based on the rate equations approach. 
We start with a general overview of lasers. 


7.1 Overview of lasers 


Generic laser structure is shown in Fig. 7.1 [1], [2], It consists of a resonator (cavity), 
here formed by two mirrors, and a gain medium where the amplification of electromagnetic 
radiation (light) takes place. A laser is an oscillator analogous to an oscillator in electronics. 
To form an oscillator, an amplifier (where gain is created) and feedback are needed. 
Feedback is provided by two mirrors which also confine light. One of the mirrors is 
partially transmitting which allows the light to escape from the device. There must be 
an external energy provided into the gain medium (a process known as pumping). Most 
popular (practical) pumping mechanisms are by optical or electrical means. 

Gain medium can be created in several ways. Conceptually, the simplest one is the 
collection of gas molecules. Such systems are known as gas lasers. In a gas laser one can 
regard the active medium effectively as an ensemble of absorption or amplification centers 
(e.g. like atoms or molecules) with only some electronic energy levels which couple to the 
resonant optical field. Other electronic states are used to excite or pump the system. The 
pumping process excites these molecules into a higher energy level. 

A schematic of a pumping cycle of a typical laser is shown in Fig. 7.2. The system 
consists of a ground state level (here labelled as ‘0’), two levels with energies E\ and E 2 and 
a band labelled as ‘3’. The system is pumped by an external source with the frequency &j 30 . 
Fast decay takes place between band ‘3’ and energy level ‘2’. In this way the population 
inversion is achieved (occupancy of level ‘2’ is larger than level ‘1’), and at some point 
laser action can take place and light with frequency a>2i can be emitted. 

The system consisting of level ‘2’ and ‘1’ can be further isolated. Popular visualization 
of such systems is known as a two-level system (TLS), see Fig. 7.3. Only two energies (out 
of many in the case of a molecule) are selected and the transitions are considered within 
those energies. As illustrated, three basic processes are possible: absorption, stimulated 
emission and spontaneous emission. 

Such TLS are met often in Nature. Generally, for an atomic system, in the case under 
consideration, we can always separate just two energy levels, upper level and ground state, 
thereby forming TLS. 
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Pump 
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Resonator (two mirrors) 


Generic laser structure: two mirrors with a gain medium in between. The two mirrors form a cavity, which confines the 
light and provides the optical feedback. One of the mirrors is partially transmitted and thus allows light to escape. The 
resulting laser light is directional, with a small spectral bandwidth. 


transitions 


Laser transitions 


Pumping cycle of a typical laser. 


hv>Eg 


absorption 

(a) 

Illustration of possible transitions in two-level system. 


stimulated 

emission 

(b) 



spontaneous 

emission 

(c) 


As mentioned above, an electron can be excited into the upper level due to external 
interactions (for lasers, process known as pumping). Electrons can lose their energies 
radiatively (emitting photons) or non-radiatively, say by collisions with phonons. 

For laser action to occur, the pumping process must produce population inversion, mean¬ 
ing that there are more molecules in the excited state (here, upper level with energy E 2 ) 
than in the ground state. If the population inversion is present in the cavity, incoming light 
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Illustration of the notation of electron transitions in a TLS. 


can be amplified by the system, see Fig. 7.3b where one incoming photon generates two 
photons at the output. 

The way how TLS is practically utilized results in various types of lasers; like gaseous, 
solid state, semiconductor. Also, different types of resonators are possible which will be 
discussed in subsequent sections. 

7.1.1 Transitions in a TLS 


Assume the existence of two energy levels E\ and E2 (forming TLS) which are occupied 
with probabilities N\ and N 2 . Also introduce: 

A 2 i the probability of spontaneous emission 
B 2 i the probability of stimulated emission 
B u the probability of absorption. 


The notation associated with those processes is illustrated in Fig. 7.4. 

We introduce V21 = (£2 — E\)/h and p(v 21) is the density of photons with frequency 
V21. Coefficients A 2 \ , B 2 \, B\ 2 are known as Einstein coefficients. The density of photons 
p(v21) of frequency V21 can be determined from the Planck’s distribution of the energy 
density in the black body radiation as [2] 


p(v21) = 


8; xhv\ x 1 

c 3 exp ^ — 1 


(7.1) 


For the laser we require that amplification be greater than absorption. Therefore, the 
number of stimulated transitions must be greater than the absorption transitions. Thus the 
net amplification can be created. In the following we will determine the condition for 
the net amplification in such systems [2], 

The change in time of the occupancy of the upper level is 


-7- = -^21^2 - B 2 \N 2 p(v2\) + B\ 2 N\p(v 2 \) ( 7 . 2 ) 

at 

First term on the right hand side describes spontaneous emission; second term is responsible 
for stimulated emission and the last one for absorption. In thermal equilibrium which 
requires that = 0, using Eq. (7.1) one obtains 

I Snhv^ 1 87 thv^ 

+ A «\ = W ' a 'V(expfr-1) 


(7.3) 
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Next, assume that N\ and Ni are given by the Maxwell-Boltzmann statistics; i.e. 



Substitute (7.4) into Eq. (7.3) and have 


(7.4) 




8 Jthvl, 




c 3 (exp |ff — 1) 


c 3 (exp |ff - 1) 


From the above equation, the density of photons p(vn) can be expressed in terms of 
Einstein coefficients as 


87r/rv|j An 

c 3 (exp |ff - l) #12 exp (Iff) - 7?2i 


(7.5) 


In the above formula, both sides are equal if 


Bn = B u 


In such case, one finds 


A 2 i _ 8jtAv|i 

£21 ~~ c 3 


The relations (7.6) and (7.7) are known as Einstein relations [2]. 


(7.6) 


(7.7) 


7.1.2 Laser oscillations and resonant modes 


Light propagation with amplification is illustrated in Fig. 7.5. Mathematically it is described 
by assuming that there is no phase change on reflection at either end (left and right). Left 
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end is defined as z = 0 and right end as z = L. At the right facet, the forward optical wave 
has a fraction r R reflected (amplitude reflection) and after reflection that fraction travels 
back (from right to left). 

In order to form a stable resonance, the amplitude and phase of the wave after a single 
round trip must match the amplitude and phase of the starting wave. At arbitrary point z 
inside the cavity, see Fig. 7.5, the forward wave is 

Eoe&e-j^ (7.8) 


where we have dropped e I0>t term which is common and defined g = g,„ — a m , where g,„ 
describes gain (amplification) of the wave and a m its losses. Also r R and r L are, respectively, 
right and left reflectivities, L length of the cavity and f) propagation constant. 

The wave travelling one full round will be 

{Eoe&e-jP*} .{ rRe gL e -jpL^ [r L e^e~^ z \ (7.9) 


The above terms are interpreted as follows. In the first bracket there is an original forward 
propagating wave which started at z, in the second bracket there is a wave travelling from z 
to L, the third bracket describes a wave propagating from z = L to z = 0, and the last one 
contains a wave travelling from z = 0 to the starting point z. At that point the wave must 
match the original wave as given by Eq. (7.8). From the above, one obtains a condition for 
stable oscillations 

r R ne 2gL e- 2 ^ L = 1 (7.10) 

That condition can be split into an amplitude condition 

r R r L e 2 ^- a " )L = 1 (7.11) 


and phase condition 


(7.12) 


From the amplitude condition one obtains 


From the phase condition, it follows 


(7.13) 


2 0L = 2 ;xn 


(7.14) 


where n is an integer. The last equation determines wavelengths of oscillations since 


lit 

’ ’ 


(7.15) 


with k„ being the wavelength. Typical gain spectrum and location of resonator modes are 
shown in Fig. 7.6a. Longitudinal modes with angular frequencies co„_i, co„ and co n+ i are 
shown. In time, the mode which has the largest gain will survive; the other modes will 
diminish, see Fig. 7.6b. 
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(b) 



Gain spectrum of semiconductor laser and location of longitudinal modes, i 
phase condition. 


, are FP resonances determined from 


output 

light 




7.2 Semiconductor lasers 


A significant percentage of today’s lasers are fabricated using semiconductor technology. 
Those devices are known as semiconductor lasers. Over the last 15 years or so, several 
excellent books describing different aspects and different types of semiconductor lasers 
have been published [3], [4], [5], [6], [7], 

The operation of semiconductor lasers as sources of electromagnetic radiation is based 
on the interaction between EM radiation and the electrons and holes in semiconductors. 
Typical semiconductor laser structures are shown in Fig. 7.7. Those are: vertical cavity 
surface emitting laser (VCSEL) [8] where light propagates perpendicularly to the main 
plane and in-plane laser where light propagates in the main plane [9]. The largest dimension 
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The basic p-n junction laser. 


of in-plane structures is typically in the range of 250 pm (longitudinal direction) whereas 
the typical diameter of VCSEL cylinder is about 10 pm. 

The basic semiconductor laser is just a p-n junction, see Fig. 7.8 where cross-section 
along lateral-transversal directions is shown. Current flows (holes onp-side and electrons on 
n-side) along the transversal direction, whereas light travels along the longitudinal direction 
and leaves device at one or both sides. 

In VCSEL, the cavity is formed by the so-called Bragg mirrors and an active region 
typically consists of several quantum well layers separated by barrier layers, see Fig. 7.7. 
(For a detailed discussion of quantum wells see [1] and [3], This discussion is beyond the 
scope of this book.) Bragg mirrors consist of several layers of different semiconductors 
which have different values of refractive index. Due to the Bragg reflection such structure 
shows a very large reflectivity (around 99.9%). Such large values are needed because a very 
short distance of propagation of light does not allow to build enough amplification when 
propagating between distributed mirrors. 

A three-dimensional perspective view of some generic semiconductor lasers is shown 
in Fig. 7.7. The structures consists of many layers of various materials, each engaged in a 
different role. Those layers are responsible for the efficient transport of electrons and holes 
from electrodes into an active region and for confinement of carriers and photons so they 
can strongly interact. Modern structures contain so-called quantum wells which form an 
active region and where conduction-valance band transitions are taking place. It is possible 
to have different types of mirrors as they also provide mode selectivity. Two basic types are 
illustrated in Fig. 7.9 and Fig. 7.10 [9], They are known as distributed feedback (DFB) and 
distributed Bragg (DBR) structures. 

In DFB lasers, grating (corrugation) is produced in one of the cladding layers, thus 
creating Bragg reflections at such periodic structure. The structure causes a wavelength 
sensitive feedback. It should be emphasized that the grating extends over the entire laser 
structure. When one restricts corrugation to the mirror regions only and leaves a flat active 
region in the middle, then the so-called DBR structure is created, see Fig. 7.10, which also 
provides wavelength sensitive feedback. 
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Basic DFB laser structure. 
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Basic DBR laser structure. 


DBR mirror DBR mirror 


Electron transitions between conduction and valence bands. 


1 Conduction 
band 

E Fc 


1 Valence 
band 


7.2.1 Electron transitions in semiconductors 


Previous discussion of TLS will now be extended to describe transitions between bands in 
semiconductors, see Fig. 7.11. For that purpose, we introduce: 

• f, the probability of the state of energy E v in the valence band being filled. 

• f c the probability of the state of energy E c in the conduction band being tilled. 

Rates of transitions can now be determined, similarly to the TLS description. Here we 
use index notation appropriate to semiconductors, i.e. c, v instead of 1,2. The rates are 


Sspon = S cv 

, S p= A 2l fcO ~fv) 

(7.16) 

S sti „ 

, = s cv 

= S 21 / c (1 - /„) p(E cv ) 

(7.17) 

Sab. 

, = s cv 

,abs = Bnfv (1 - fc) P(E CV ) 

(7.18) 
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E Fo 

E c 


E v 



P-type n-type 

(b) 


Energy-band diagram of a p-n junction, (a) thermal equilibrium, (b) forward bias. 


Fermi-Dirac statistics are assumed for the appropriate probabilities a 
1 


fM 


exp (— 


" -p(^ + l) 

Here Ep c and Ep v are the quasi-Fermi levels for electrons and holes. 
The condition for stimulated emission to exceed absorption is 


(7.19) 

(7.20) 


which gives 

Bnfc (1 - fv) > Bnf v (1 - fa) 

Since the coefficients £21 and £12 are equal, after substituting expressions (7.19) and (7.20) 
one obtains 



which can be written in the form 

Efc - E Fv >E C -E v = hv (7.21) 

The above inequality is known as Bemard-Duraffourg condition [10], 


7.2.2 Homogeneous p-n junction 


Energy-band diagrams of a p-n junction in thermal equilibrium and under forward bias 
conditions are shown in Fig. 7.12. 
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Electrons at the n-type side do not have enough energy to climb over the potential barrier 
to the left. Similar situation exists for holes in the p-type region, see Fig. 7. 12a. One needs 
to apply forward voltage to lower the potential barriers for both electrons and holes, see 
Fig. 7.12b. 

An application of the forward voltage also separates Fermi levels. Thus two different 
so-called quasi-Fermi levels E Fc and E Fv are created which are connected with an external 
bias voltage as 

E Fc - E Fv = eV bias (7.22) 

With the lowered potential barriers, electrons and holes can penetrate central region where 
they can recombine and produce photons. However, the confinement of both electrons and 
holes into the central region is very poor (there is no mechanism to confine those carriers). 
Also, there is no confinement of photons (light) into the region where electrons and holes 
recombine (the region is known as an active region). Therefore, the interaction between 
carriers and photons is weak, which makes homojunctions a very poor light source. One 
must, therefore, provide some mechanism which will confine both carriers and photons into 
the same physical region where they will strongly interact. Such a concept is possible with 
the invention of heterostructures, which will be discussed next. 

7.2.3 Heterostructures 


A heterojunction is formed by joining dissimilar semiconductors. The basic type is formed 
by two heterojunctions and it is known as a double-heterojunction (more popular name is 
double-heterostructure). 

The materials forming double heterostructures have different bandgap energies and 
different refractive indices. Therefore, in a natural way potential wells for both electrons 
and holes are created. Schematic energy-band diagrams of a double-heterostructure p-n 
junction in thermal equilibrium and under forward bias conditions are shown in Fig. 7.13. 
The bandgap and refractive index for InGaAsP material will now be summarized. 


Bandgap 

For the In\- x Ga x As y P\- y system, the relation between compositions x and y which results 
in the lattice-match to InP is 

°- 189 * (7.23) 

0.4184 — 0.013y v ’ 

In such case the bandgap is [11], [12] 

E gap [eV] = 1.35 - 0.12y + 0.12/ (7.24) 

with the extra relations y ~ 2.20.x and 0 < x < 0.47. The material for such compositions 
thus covers bandgaps in the range of wavelengths 0.92 pm <£> 1.65 pm. For example, the 
material /«o.746/00.26d.s'o.57^0.43 (i.e. x = 0.26, y = 0.57) has bandgap E g = 0.97eV, which 
corresponds to the wavelength A = 1.27 pm. 
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Table 7.1 Typical values of refractive indice 

is for various compositions. j 

Wavelength 

Range of y compositions 

Refractive index 

X = 1.3 pm 

0 <y < 0.6 

n(y) = 3.205 + 0.34y + 0.21/ 

X = 1.55 pm 

0 <y < 0.9 

n(y) = 3.166 + 0.26y + 0.09/ 




p-type Active n-type 


(b) 

Energy-band diagram of a double-heterostructure p-n junction, (a) Thermal equilibrium, (b) forward bias. 


Refractive index 

As was explained before, the dependence of refractive index as a function of wavelength 
is described by Sellmeier equation. For important telecommunication wavelengths, namely 
1.3 pm and 1.55 pm, they dependence of refractive index of In\- x Ga x As y P\- y that is lattice 
matched to InP is shown in Table 7.1 [13]. 

In the wells formed by heterostructures, electrons and holes recombine thus generating 
light. Also, differences in the values of refractive indices such that central region has 
larger value of refractive index create planar optical waveguide where light propagates and 
efficiently interacts with carriers. 

Some illustrative values of bandgaps and refractive indices for InGaAsP/InP heterostruc¬ 
ture are shown in Fig. 7.14. 


7.2.4 Optical gain 


For efficient and reliable numerical simulations, an exact mathematical expression of the 
material gain is critical. Such an expression should also be subject to an experimental 
verification. This problem has been investigated since the early developments of semicon¬ 
ductor lasers [14], [15], [16]. The first step is usually the determination of gain spectra 
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energy 


n-InP 

InGaAsP 

p-InP 


active 

1 




T —r - 

1.35eV 0-9|eV 


Refractive 



J|| Band structure and refractive index for InGaAsP/lnP heterostructure. 


and its comparison with experiment. The simplest approach will be provided in the next 
section. Typical curves of gain spectra for a four quantum well system and comparison 
with experimental measurements are shown in Fig. 7.15, from [17]. Mathematics involved 
in determining optical gain of semiconductor quantum-well structures is complex and has 
been discussed extensively, for example [18], [19], [20]. As a result, analytic approxima¬ 
tions of the optical gain which can be used in fast calculations were determined, see [21], 
[22], [23]. From an extensive discussion it was determined that the peak material gain for 
bulk materials varies linearly with carrier density 

g(7V, X) = a(X) min (TV, P) - b{X) (7.25) 

The parameter a (A.) is commonly called the differential gain. 

On the basis of experimental observations, Westbrook [24], [25] extended the linear gain 
peak model to allow for wavelength dependence. In its simplest form it can be written as 
[24], [26], [27] 

g(N, X) = a{X p )N - b{X p ) -b a (X- X p f (7.26) 

where X p is the wavelength of the peak gain and b a governs the base width of the gain 
spectrum. Wavelength peak X p can be also carrier density dependent. 

Gain (absorption) in a semiconductor is a function of carrier density n [cm -3 ] (see 
Fig. 7.16, which shows gain spectrum for several values of carrier concentration) [28], 
When n is below transparency density n tr , the medium absorbs optical signal. For n > n tr , 
optical gain in the material exceeds loss. The dependence of the so-called gain peak on the 
carrier density for quantum well systems is logarithmic, see Fig. 7.17. It is described by the 
expression [4] 

g = g 0 ln|- (7.27) 

For modelling purposes, a linear approximation is often employed. In the above formulas 
go is the differential gain, N tr transparency carrier density and N t h is the carrier density at 
threshold. 
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Fitted theoretical and experimental net modal gain versus photon energy including leakage terms for three values of 
injected current, from [17]. 



^ Schematic optical gain spectra for various carrier densities. 
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7.2.5 Determination of optical gain 


The simplest approach to determine optical gain is based on Fermi golden rule. Here we 
derive it for T = 0. Transition rate between two levels a and b is [1] 

Wa = y l H «*| 2 8 (Eb -Ea-hco) (7.28) 

where Y\' ah is the matrix element describing transitions between levels a and b. Assume 
parabolic bands for both electrons and holes with the effective mass m* taking the value of 
m* for conduction band and m* for a valence band as follows: 

h * 2 k 2 

E(k)=— (7.29) 

2m 

From the above 

E„-E a = — )+E g (7.30) 

2 \m c m v J 

Due to conservation of the crystal momentum in the transition, one has = kfi [29]. For 
a single transition at specific value of k, one therefore has 

W{k)=Y\H' V c(k) 2 \8^^+E g -hwj (7.31) 

where m r = is reduced effective mass. If A is a total number of transitions per second 

in crystal volume V and g(k) = is number of states per emit k in volume V, the total 
number of transitions per second is 

N — j W (k) ■ g (k) dk 


(7.32) 
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To evaluate integral, let us introduce new variable X = + E g — hco. Integral takes the 

form 

N = \H' C (k)\ 2 (X) y/^ (X + Juo- E g )dX 

V I n ^\2 ( 2m r ) 3/2 ( ,1/2 

= - \ H vc (k)\ ——( Hco-E g ) (7.33) 

where + E g = hco. We are in a position now to determine absorption coefficient a. It 
is defined as 


a (co) 


power absorbed per unit volume 
power crossing a unit area 


(N) ■ (hco)/V 
e 0 hE 2 c/2 


(7.34) 


where n is refractive index, c is the velocity of light in a vacuum and E 0 is the amplitude of 
electric field. Use 

Kc (*) = Xvc = (u vk ' M U ck ) (7.35) 

Using the above and expression for N, one finds 


a 0 (co) = 


coc^Xve ( 2 m r ) 3/2 
2neonch 3 


(hco-E g ) 1/2 


= K (hco — Egj 1/2 


(7.36) 


Example Estimates of K. 

Data for GaAs. hco = 1.5 eV, m v = 0.46mo, m c — 0.067 oto , Xvc = 3.2 A ,n = 3.64. 
Those data give K « 11 700 cm -1 (eU) -1/2 , and ao (co) =1170 cm -1 [1], 

At T = 0 depending on the value of hco, one has the following possibilities summarized 
next: 

hco < Eg a (co) = 0 

Eg < hco < E Fc — E Pv a (co) = —ao (co) — —K(hco — Eg) 1 / 2 amplification (7.37) 
E Fc — E Fv < Hco a (co) = ao (co) = K(hco — Eg) 1 / 2 absorption 


7.3 Rate equations 


With all basic elements in place we are now ready to provide the simplest (phenomenolog¬ 
ical) description of semiconductor lasers. It is based on rate equations [4], The main role 
in those devices is played by two subsystems: carriers (electrons and holes) and photons. 
They interact in the so-called active region, defined as part of the structure where recom¬ 
bining carriers contribute to useful gain and photon emission. We describe both subsystems 
separately, starting with carriers. 

In Fig. 7.18 (adapted from [4]), we schematically showed model of a laser operating 
below threshold. It resembles a tank partially filled with water continuously flowing in and 
at the same time water leaves the tank. The water models carriers which are continuously 
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Model of a laser operating below threshold. From L. A. Coldren and S. W. Corzine, Diode Lasers and Photonic Integrated 
Circuits, Wiley (1995). Reprinted with permission of John Wiley & Sons, Inc. 


leakage 
current 




Model of a laser operating above threshold. From L. A. Coldren and S. W. Corzine, Diode Lasers and Photonic Integrated 
Circuits, Wiley (1995). Reprinted with permission of John Wiley & Sons, Inc. 


provided by current flowing in. Not all the current at electrodes actually reaches the device 
(tank). Some of it is lost as so-called leakage current. Below threshold, carriers disappear 
through losses in the device (/?/«„), non-radiative recombination (R nr ) and spontaneous 
emission (R sp ). Above threshold (shown schematically in Fig. 7.19), the tank is completely 
filled with water (in laser the situation corresponds to the so-called threshold with density 
Nth)- Its operation is mostly dominated by yet another process, namely stimulated emission 
(here characterized by coefficient R st ). 

Next, based on the above picture we will formulate equations describing dynamics of 
carriers and photons. 


7.3.1 Carriers 


Inside the laser there exist two types of carriers: electrons and holes. Both are described in 
a similar way; however, parameters used will be different for both types. 

The rate of change of carrier density is governed by 


dN 

dt 


Ggener ~ Rrecom 


(7.38) 


where the terms on right are responsible for generation and recombination of carriers and 
are given by 



with V being the volume of the active region, q is the electron’s charge and / is the electric 
current. The internal efficiency % describes the fraction of terminal current that generates 
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carriers in the active region. The recombination term consists of several contributions 
Rrecom = Rsp + Rfir + Rl + R s t 
The meaning of terms is as follows: 

R sp — spontaneous recombination rate 
R nr — nonradiative recombination rate 
Ri — carrier leakage rate 
R st — net stimulated recombination rate 

Recombination processes are described phenomenologically as [4] 

Rrecom = + V g g{N)S (7.39) 


7.3.2 Photons 


Let S be the photon density. We postulate the following rate of change of photon density: 

f = ™»-| + r ^ 


with the following definitions: 


Tp - photon life-time 

P sp - spontaneous emission factor (reciprocal of the number of optical modes) 

R s , - stimulated recombination. 

T - confinement factor which is the ratio of the active layer volume to the volume of the 
optical mode. 


Consider growth of a photon’s density over the active region, (assume F = 1) 
S + AS = Se* Az 


where g is gain. 

if Az <?C 1 then exp (g ■ A z) sa 1 + gAz 

Using the relation Az = v g At (i^-group velocity), one finds S = Sg- v g - At. 
Thus, the generation term can be written as 


(§) 


") =*„ = = = 

dt) A< 


v g gS 


Finally, the rate equations used in this section are 


dN IN 

#=’H-r- 7 - v *WS 


(7.40) 

(7.41) 


We have explicitely indicated that gain g depends on a carrier’s concentration. 
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7.3.3 Rate equation parameters 


Other important parameters which in the simple model can be taken as constants, in fact 
have complicated dependencies. Those parameters are: 

1. the carrier’s lifetime r which strongly depends on carrier’s density. Typical dependence 
is shown below: 


1 


= A+BN+ CN 2 


where coefficient A describes non-radiative processes, B is responsible for spontaneous 
recombination and C describes non-radiative Auger recombinations. 

2. photon lifetime x p which is 

1 


n = r gtk 


Here a m describes mirror reflectivity 


In 


and a:, account for all losses. 

The meaning of all parameters appearing in rate equations and gain models is summarized 
in Table 7.2 along with typical values for those symbols. The parameters were collected 
from [30], [31] and [32], 

In the following, rate equations will provide a starting point for analysis of dynamical 
properties of semiconductor lasers. Before we start detailed analysis based on rate equations, 
we will establish a rate equation for electric field. 

7.3.4 Derivation of rate equation for electric field 


The relevant equation will be derived starting from the wave equation. We concentrate on 
Fabry-Perot lasers. The starting wave equation is 

V 2 E(r, 0 - ^^E(r, t) = Mo^yP(r, 0 (7.42) 

Assume the following decomposition: 

E(r, t) = a E,{x,y) sin (&z) £(/)<>' (7.43) 

where a is a unit vector describing polarization, fi z propagation constant in the z-direction. 
The transversal field E t (x, y ) obeys the following equation: 

/ 3 2 3 2 \ , 

\M + W) Et{X ' y) = - K ' EtM {1M) 

It is further assumed that E(t) is slowly varying compared to e 7 ®'. Substituting solution 
(7.43) into (7.42), neglecting fast-varying terms in time and using (7.44), one obtains 

~ Pi ~ ^ ^ - « 2 ^(t)J J a E t (x,y) sin (fi-z) e jat = ii Q ^P(r, t ) 
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j Table 7.2 Basic parameters appearing in rate equation approach and their typical values. ’ } 

Symbol 

Description 

Value and unit 

N 

carrier density 

cm- 3 

S 

photon density 

cm" 3 

I 

current 

mA 

<1 

elementary charge 

1.602 x 10“ I9 C 

L 

cavity length 

250 pm 

w 

width of active region 

2 pm 

d 

thickness of active region 

80A 

rii 

fraction of the injected current 



I into active region 

0.8 

V aclive 

volume of active region 

L-wd 

t 

carrier lifetime 

2.71ms 

y g 

group velocity 

c/n K f 

n m f 

refractive index 

3.4 

r p 

photon lifetime 

2.77 ps 

r 

confinement factor 

0.01 

Psp 

spontaneous emission factor 

10- 4 

a 

differential gain (linear model) 

5.34 x 10“ 16 cm 2 

Nfr 

carrier density at transparency 

3.77 x 10 18 cm" 3 

Ith 

current at threshold 

1.11mA 

dm 

facet loss 

45 cm" 1 

X ph 

laser wavelength 

1.3 pm 


To evaluate terms on the right hand side, we need to account for a nonlocal relation between 
polarization and susceptibility: 

P(r, t)=eo f X (r, t')E( r, t - (7.45) 


In the above E(r,t — t') is given by (7.43). As E (t) is slowly varying in time, we can expand 
it into Taylor series around t. The general formula for Taylor expansion of the function 
/( x ) around a is f(x) = f (a) + ^ (x — a). In our case, substituting x = t — t' and a — t, 
we obtain 


E(t -1') 


Full electric field is therefore 


E( r, t-t')= E t (x,y ) sin (fi z z)E(t - t r )e M ‘- t ' ) 


Substituting the last result and Taylor expansion for E(t—t') into expression for polarization 
(7.45), one finally obtains the wave equation for £(f): 


\-pl] + —s r {co)\E(t)- 


2jco 


de r {(o )'I 
dco \ 


m =0 


(7.46) 
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Here e r (r, co) = 1 + / (r, co) and Fourier transform of susceptibility is (we have dropped r 
dependence) 



Susceptibility is further separated into terms which account for various physical effects as 
follows: 

X («) = 1 + Xb + Xp~ JXloss 

where Xb = X* + jx't is due to background, Xp = X p + jXp is induced by pump and 
Xioss accounts for losses. The relative dielectric constant is written as (dropping argument 
dependencies) e r = e' r + js". The real part of s r is approximated in terms of refractive 
index s' r = («o + An p ) 2 ~ «q + 2h 0 A«p, where n 0 is the background refractive index and 
An p is the change induced by pump. Using the above results, s r takes the form 


e r = e' + js" « n\ + 2n Q An p + js" 



(7.47) 


Explicitly, one has the following identifications: 

1 + X' b = »o 

Xp = 2n 0 An p 
Xb + Xp ~ Xioss = K 


Using the above relations, the term in the equation for slowly varying amplitude E(t), 
Eq. (7.46) is 



(7.48) 


where we have used the approximate relation 



Imaginary parts of background and pump susceptibilities are related to gain via an experi¬ 
mental relation 


— U + x;) = r g (A0 

C«o ^ ' 


where g(N) is gain. Assume also 



With the last two relations, term given by Eq. (7.48) can be expressed as 

^ySrico) -Po = ——+ ^rlno&rip + 7 — [Vg(N) - 
c z c z c z c 
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Using n 2 (co) = e r (co), the dispersion term in (7.46) is evaluated as 


1 de r (co) _ 2 1 dn(co) 

s r (M) + -w— - = n (co) + ~(on(co) — - 

2 dco 2 dco 


m n(co) + con g (co) 


where we have defined group index n g (oj) a 


(7.49) 


(7.50) 


Using the results (7.48) and (7.49), Eq. (7.46) is 

dE(t) 
dt 


where we approximated « 2 - = (<u - o>o) (tu + coo) — 2co (co - ojo) and also ~ 1 

and used the relation f = v g . Our Eq. (7.51) is very similar to Eq. (6.2.9) of Agrawal and 
Dutta [9], 


7.4 Analysis based on rate equations 


Rate equations just introduced will now be used to study some dynamical properties of 
semiconductor lasers. We start with steady-state. 

7.4.1 Steady-state analysis 


In this situation one has no change in time; i.e. 

dN _ dS _ o 
dt dt 

Rate equations take the form 

*4-“- ^sms 0 = o (7.52) 

qV x 

and 

rv g g(No)S 0 -j- + r p sp R sp = 0 (7.53) 

If in the last equation we neglect the small spontaneous emission term, we obtain an 
expression for a photon life-time 

I 


- = fv g g(No) 


(7.54) 
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7.4.2 Small-signal analysis with the linear gain model 


When neglecting spontaneous emission and using linear gain model where gain is given by 
g= a (N - N tr ), one obtains 

^ = q~---v g a(N- N^) S (7.55) 

at qV x 

and 

^ = T v g a (N - N tr )S-— (7.56) 

at s xp 

Assume that all time-dependent quantities oscillate as 
/ = k + i{co)e M 
N = N 0 + n(a))e imt 
S = So + s(co )e'"“ 


where co is the angular frequency of an external (small) perturbation and 7o is the bias value 
of current. Here No and S 0 are the solutions in the steady-state. Substitute into the first rate 
equation and after multiplication and neglecting second-order term (n(co)s(a ;)) one obtains 

jcon(co)e jwt = [/ 0 + i(co)e ia,t ] - — - -n(co)e ia>t 

qV L J x x 

-v g a ((No - N tr ) So + (N 0 - N tr ) s((o)e ia,t +Son((o)e ia>t ) + 0(n 2 ) 

Using steady-state results, expression (7.54) for photon life-time and dropping e jo>t depen¬ 
dence, one finally obtains 

jcon(a)) = ^ ~ v g aS 0 n((o) (7.57) 

The second equation for photons is obtained in a similar way. Substituting small-signal 
expressions, neglecting the second-order term, using the photon life-time expression and 
finally dropping e ], '“ dependence, one obtains 


jcos(co) = r v g aSon(w) (7.58 

From the above equations, we want to determine modulation response which is defined a 
[33], [34] 

_ s(co) 


M(a>) = 


i(co) 


Expressing n(w) from Eq. (7.58) and substituting into (7.57), one obtains 
v g aS 0 


where D(co) is given by 


2 /I v g aS 0 

) = — u> +/&>(— + v g aSo J H-—— 
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Define response function as 


\s_m\ 
I Ko>) I 


^TvgdSo 

' |7)(oOI 


r{co)= | 

If we write D(a>) = a + jb, we have \D(co)\ 2 = a 2 + Therefore 

mo,)? = - ^) 2 + o? (i+**)’ 


We will see soon that the response function has a peak at frequency /r. To find that 
frequency, we evaluate first derivative a|J ^^ and set it to zero. Explicitly 


3|D( W )| 2 | „ ( VgaSo 

dco 2 L=«* Vi 


_ 2 (r + ^ a5 °) = 0 


From the above 

^2 = Vgf$o _ 1 Q + ( 7 . 59 ) 

which is known as a relaxation-oscillation frequency and it describes the rate at which energy 
is exchanged between photon system and carriers. The second term in (7.59) is usually very 
small and can be neglected. Relaxation-oscillation frequency is thus approximated as 


COR 


1 Sov g a 
t p 1 + eS 0 


(7.60) 


To estimate the typical values of the relaxation-oscillation frequency, let us assume [1]: 
L = 300 pm, t p h w 10 _12 5 and r ~ 4 x 10 -9 s. One finds AP 0 ~ 10 9 s _1 . So, the 
zeroth-order expression is a very good one. 

Using MATLAB code and parameters shown in Listings 7A. 1.1 and 7A. 1.2, one obtains 
the results shown in Fig. 7.20. 


7.4.3 Small-signal analysis with gain saturation 


Let us remind ourselves of the rate equations which were established in the previous section: 


dN _ I N 
dt T,t qV t 

<l j- = r v g g(N)S - 
dt 

Use small-signal assumptions 


- v g g(N)S 


S 


+ PspRsp 


(7.61) 

(7.62) 


7(0 = 7 0 + m 
N(t ) = JV 0 + «(0 
S(t) = So + s(t) 


Also, generalize the gain model to include saturation as follows: 

, go+g(N-N 0 ) 


1+eS 
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where g' = Jj| is differential gain evaluated at TV = Nq and go = g(No). The factor 
1 + eS introduces nonlinear gain saturation. The gain compression parameter e has a small 
value and the term eS is small compared with the one even at very high optical power. The 
effect of this term on dc properties is small and can be neglected. However, it significantly 
affects dynamics of a semiconductor laser. The reason is that the laser’s dynamics depends 
on the small difference between gain and cavity loss. This difference is only about a few 
percent. Therefore, even small gain compression due to e produces significant effect. 

Evaluate the gain compression term as follows: 

1 + = 1 + e (5o + 5 ( 0 ) = (1 + ego) ^1 + ^ + ^ ^ 

With the above result, gain can be approximated as 


gin) = 


go 

1 + eSo 


g'nfl) 

1 + cSo 


go esjt) 

1 + ego 1 + ego 


(7.63) 


Using the above results, stimulated emission term takes the form 


g(n)S = 


goSo 


g'Sp 


go go 


1 + eg 0 ' 1 + eS 0 n{t) (1 + eSo) 2 65(0 4 1 + eg 0 
As usual, the following analysis is separated into two parts: dc and ac analysis. 


5(0 + 0(5 2 ) (7.64) 


dc analysis 

With the assumption that Jp = 0 from rate equations, one obtains 


7o_ _ No _ gpgp _ 0 
^ qV x Vg 1 + eS 0 
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and 


Tv g 


goSo 

1 + eSo 


So 


+ PspRsp 


= 0 


Neglecting spontaneous emission (f} sp = 0), from the second equation one obtains an 
expression for the photon life-time r p : 


(7.65) 


ac analysis 

Substitute small-signal assumptions into rate equations, use an approximation for stim¬ 
ulated emission (7.64) and eliminate dc terms. The results are 


dn(t) 

dt 


i(t) n(t ) v g g'So 1 5b 1 

7z—rr-—w- n(t ) + —- —— —es(t) --s(f) 

qV x 1 + gSq XpT 1 + gSq x p V 


ds(t) 

dt 


TvggSo 

1 + eSp 


n(t) — 


So 

1 + eSo 


es(t ) 


In the above equations we used expression (7.65). Those equations can be written in matrix 
form: 



where 

J= 1 VgSpg' _1 C c= r VgSol D= 1 Sq c 

x 1 + eSo XpT x p T 1 + eSo 1 + eSo x p 1 + eSo 


Assuming harmonic time dependence of the form exp (Jcot), the matrix equation is 


i>-m b 1 r»«)i_ Ufi 
L -c >+ojL»(oJ [ 0 J 

Solving the above system, one obtains modulation response which is expressed 
s(t) _ 1 C 


where 


H(co) = (jco + A) (jco + D) + CB 


The function H(a>) can be written as 

H(co) = -co 2 + jcoy + co 2 r 

Here cor is the relaxation-oscillation frequency and y is the damping factor. One defines 
modulation response r{co) as [30] 


r(co) = 


H{ 0 ) 
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The effect of gain compression on modulation response. 

The plot of response r(co) is shown in Fig. 7.21 for three values of e. The effect of gain 
compression parameter e is clearly visible. MATLAB code used to obtain those results is 
provided in Listings 7A.2.1 and 7A.2.2. 

7.4.4 Large-signal analysis for QW lasers 


Large-signal analysis based on rate equations for semiconductor lasers has been reported 
extensively [35], [36], [37], Main equations without spontaneous emission are 

^ = m Jy - J ~ v g g ( N ) ■ s ( 7 - 66 ) 

^ = r-v g g(N)-S-y (7.67) 

All symbols have been explained previously and their values are summarized in Table 7.2. 

Typical results for step current are shown in Fig. 7.22. MATLAB code used to obtain 
those results is provided in Appendix, Listings 7A.3.1 and 7A.3.2. 

One can observe that when we neglect spontaneous emission term in the rate equation 
for photons, photon density tends to zero (steady-state value) as t —> oo, the result which 
is also evident from the steady-state analysis. 

7.4.5 Frequency chirping 


Frequency chirping arises during direct modulation of semiconductor laser when carrier 
density undergoes abrupt change. As a result, material gain also changes. This change 
results in a variation of the refractive index, which in turn affects the phase of the electric 
field. 
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Numerical solutions of large-signal rate equations after applying rectangular current pulse at t = 0. Normalized 
values of electron density N(t) and photon density S(t) are shown. 


The rate of change of the phase is 

d4> 1 , r , 

= 2 Va H °W 

An associated change in frequency is created which is described by 
1 d<f> 1 

Av(f) = — — = —va enh (rg-aioss) 

2jt dt An 

This change in frequency is called ‘chirp’. During direct modulation it results in a shift in 
frequency of the order of 10-20 GHz. 

To eliminate chirp effect, an external modulator should be used. In such cases a laser 
produces constant output power and a separate modulator provides modulation. 

Another way to reduce chirp is to fabricate devices with small values of line-width 
enhancement factors a#. 

dn r 

aH — -Jrr 

dv 

where n = n r + j% (complex refractive index). 


7.4.6 Equivalent circuit models 


Often in engineering practice there is a need to operate with circuit models, instead of more 
complicated physical-based models. Here we outline the method of constructing such a 
model for a semiconductor laser. 
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7.4.7 Equivalent circuit model for a bulk laser 


Equivalent circuit models of semiconductor lasers provide further understanding of the 
laser properties. They are derived from rate equations. Development of those models was 
initialized a long time ago [38], [39], [40]. More recent works which also include additional 
effects like carrier transport or proper representation of optical gain, are [41], [42], [43]. In 
the following, we outline the creation of equivalent circuit model for a bulk semiconductor 
laser following Kibar et al. [41], 

For bulk lasers, rate equations are [41], [9] 

= ( G-y)S + R sp (7.68) 

= - — y e N — G - S (7.69) 

q 

Here N, S are the total number of electrons inside the cavity and total number of photons in 
the lasing mode. Those are dimensionless quantities. We introduce small deviations from 
equilibrium as 


dS 

dt 

dN 

dt 


S(t ) = S 0 + SS(t ) 
N(t) = No + SN(t ) 


(7.70) 

(7.71) 


Optical gain is approximated as 


G = G 0 + ^SN + ^SS = G 0 + G n SN + G s SS (7.72) 


In the steady-state one obtains 


0 = (Go — y) So + R sp ,o 
0 = ^ - y e N 0 - Go ■ So 


Observing that spontaneous emission R sp = R sp (N) depends on N , and performing an 
expansion 

R sp = R sp ,o+ d ^fSN (7.73) 


one can derive small-signal equations 
d8S 
dt 
dSN 

~dT 


= — TsSS(Xj^SN 
= —VpfSN — cxgSS 


(7.74) 

(7.75) 


Is = — SoGs, r N = Ye + N 0 ~^ + S 0 G N 

dRsp 


on = SqGn + 


dN ’ 


crs = Go + SoGs 


(7.77) 
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Equivalent circuit model for a bulk laser. 


Using these equations, one creates equivalent circuit model. It is based on an observation 
that the small-signal rate equations are similar to the voltage and current equations of 
the RLC circuit. We therefore assume (following Kibar et al., [41]) that electrons in a 
laser cavity can be represented by the charge across a capacitor and the photons are 
represented by the magnetic flux of an inductor. Formally one introduces the following 
equivalence: 


8S(t) = ^ , 8N(t) = — 

q • unit q 

(7.78) 

where q is an electron’s charge, unit is a parameter in [ j] to ensure proper units for the 
components of the electrical circuit and 0(f) is the magnetic flux in [Wb], We also recall 
standard relations from electromagnetism: 

Q = Cv c 

(7.79) 

. dQ 

lc = lu 

(7.80) 

0(0 = L ■ 4 

(7.81) 

$ 1 * 

II 

(7.82) 


Now, we use (7.78) and relations (7.79)—(7.82) to convert small-signal equations (7.74) and 
(7.75) into voltage and current relations. After simple algebra, from (7.74) one finds 


vlO) = ~r s ■ L ■ i L (t) +on C ■ unit ■ vc (7.83) 


This equation can be identified with loop equation for the RLC circuit shown in Fig. 7.23. 
A loop equation gives 

vc = v L + v Rp (7.84) 


In order for the above equations to be identical, one makes the following identifications: 


Rs = 






unit ■ (Ty 


(7.85) 
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Similarly, from (7.75) one obtains 

ic(t) = -r N ■ C • uc(0 ——r ■ L ■ hit ) (7.86) 

unit 

That equation can be compared with the node equation from a circuit in Fig. 7.23: 

ic = —iR — ii (7.87) 

For those equations to be identical, one sets 

% = 1 , and L = (7.88) 

r N c a s 

One also has the following equivalence: 

C ■ Vr L ■ ir 

SN — -and SS = -— (7.89) 

q q • unit 

7.5 Problems 


1. Perform large-signal analysis for the log gain model described by Eq. (7.27). 

2. Analyse large-signal response to a Gaussian pulse. 

3. Based on rate equations for multimode operation, create the small-signal equivalent 
circuit model of a semiconductor laser. Draw the corresponding electrical circuit and 
derive equations which establish links between circuit parameters and rate equations. 
Hint: Consult the paper by Kibar et al. [41], 

4. Construct equivalent circuit model for single quantum well laser with carrier transport 
(follow Lau’s work in [42]). Introduce carrier density in an SCH region and allow for 
its interaction with densities in the quantum well. 


7.6 Project 


1. Conduct multimode analysis of a semiconductor laser. Formulate multimode rate equa¬ 
tions. Gain spectrum approximate as described by Coldren and Corzine [4], 


Appendix A: MATLAB listings 


In Table 7.3 we provide a list of MATLAB files created for Chapter 7 and a short description 
of each function. 







Appendix 7A 


1 Table 7.3 List of MATLAB functions for Chapter . 1 

Listing 

Function name 

Description 

7A.1.1 

small signal .m 

Small-signal analysis 

7 A. 1.2 

param-rate-eq-bulk.m 

Parameters for bulk active region in rate eqs. 

7A.2.1 

small epsilon.m 

Small-signal response with gain compression 

7A.2.2 

param rate eq QW.m 

Parameters for QW active region in rate eq. 

7A.3.1 

large signal.m 

Large-signal analysis 

7A.3.2 

eqsJarge.m 

Defines rate equations for large-signal analysis 


Listing 7A.1.1 Program smalLsignal.m. MATLAB program to perform small-signal anal¬ 
ysis for a quantum well semiconductor laser. 

7. File name: small_signal.m 
/ Purpose: 

7. Determines response function 
clear all 

param_rate_eq_bulk % input data 

7. 

7, loop over frequency in GHz 

N_max = 5000; 7. number of points for plot 

f_min = 0.01; f_max = 100; 

freq_GHz = linspace(f_min,f_max,N_max); 7. From 0.1 to 2 GHz 
semilogx(freq_GHz,0); 

1 

freq = freq_GHz*ld9; 7. convert frequency to 1/s 

omega = 2*pi*freq; 

1 

hold on 

for power_out = [ld-4 ld-3 0.01 0.1 1.0] 7. values of output power [W] 

7. Determine steady-state photon density from a given output power 
S_zero = 2*power_out/(v_g*alpha_m*h_Planck*freq_ph*V_p); 

1 

7. Construct denominator 

D = - omega.*2 + lj*omega*(l/tau +v_g*a*S_zero) + v_g*a*S_zero/tau_p; 

1 

response = (v_g*a*S_zero/tau_p)./abs(D); 
response_dB = 10*log(response); 

1 

h = semilogx(freq_GHz,response_dB); 
end 


xlabel(’frequency [GHz] ’,’ FontSize’,14); 
ylabel(’Response function [dB]’, 1 FontSize’,14); 


7, size of x label 
7. size of y label 
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set(gca,’FontSize’,14); size of tick marks on both axes 

axis([f_min f_max -20 35]); 

text(0.05, 7, ’10 \muW’,’Fontsize’,14) 

text(0.2, 18, ’1 mW’,’Fontsize’,14) 

text(0.6, 27, ’10 mW’,’Fontsize’,14) 

text(2.5, 33, ’0.1W’,’Fontsize’,14) 

text(8, 26, ’1W’,’Fontsize’,14) 

pause 

close all 


Listing 7A.1.2 Function param rate eq bulk.m. Function contains parameters for the rate 
equation model for the bulk active region. 


I File name: param_rate_eq_bulk.m 
7. Purpose: 

’/, Contains parameters for rate equation model for bulk 
’/, active region 
7, Source 

7 G.P. Agrawal and N.K. Dutta 
7. Long-wavelength semiconductor lasers 
7. Van Nostrand 1986 
7. Table 6.1, p. 227 
7. 

7. General constants 

c = 3dl0; 7. velocity of light [cm/s] 

q = 1.6021892d-19; 7. elementary charge [C] 

h_Planck = 6.626176d-34; 7. Planck constant [J s] 

hbar = h_Planck/(2.0*pi); 7. Dirac constant [J s] 

7. Geometrical dimensions bulk active region 

length = 250d-3; 7. cavity length [cm] ; 250 microns 

width = 2d-3; 7. active region width [cm] ; 2 microns 

thickness = 0.2d-3; 7. thickness of an active region [cm]; 0.2 microns 

volume_active = length*width*thickness; 7. volume of active region 

7. 

conf = 0.3; 7. confinement factor [dimensionless] 

V_p = volume_active/conf; 7. cavity volume [cm~3] 

ref_index = 3.4; 7. effective mode index 


v_g = c/ref_index; 
tau_p = 1.6d-12; 
tau = 2.2d-9; 

7. 


7. group velocity [cm/s] 

7. photon life-time [s] 

7. carrier life-time [2.71 ns] 


a = 2.5d-16; 7. differential gain (linear model) [cm~2] 

7. Parameters needed to determine output power 
alpha_m = 45; 7. mirror reflectivity [cm~-l] 

lambda_ph = 1.3d-3; 7. laser wavelength [microns]; 1.3 microns 

freq_ph = v_g/lambda_ph; 7. phonon frequency 
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Listing 7A.2.1 Function smalLepsilon.m. Driver program which determines small-signal 
response with gain compression for a single quantum well. For large-signal analysis based 
on rate equations. 

I File name: small_epsilon.m 
7. Purpose: 

7. Determines response function for quantum well with epsilon 
clear all 

param_rate_eq_QW “/, input data 


7. loop over frequency in GHz 
N_max = 100; 

f_min =0.1; f_max = 100; 

freq_GHz = linspace(f_min,f_max,N_max); 

semilogx(freq_GHz,0); 


7. number of points for plot 

7. From 0.1 to 2 GHz 

7. make axis and force log scale 


freq = freq_GHz*ld9; 7. convert frequency to 1/s 

omega = 2*pi*freq; 

power_out = 0.001; 7» output power (10 mW/facet) 

* 

hold on 

for epsilon = [0 1.5d-17 5d-17] i values of epsilon 

7. Determine steady-state photon density from a given output power 
S_zero = 2*power_out/(v_g*alpha_m*h_Planck*freq_ph*V_p); 


% Construct denominator 

A = 1/tau + v_g*S_zero*a/(l+epsilon*S_zero); 

B = l/(conf*tau_p) - (l/(conf*tau_p))*S_zero*epsilon/(l+epsilon*S_zero); 
C = conf*v_g*S_zero*a/(l+epsilon*S_zero); 

D = (l/tau_p)*S_zero*epsilon/(l+epsilon*S_zero); 
damping = A + D; 
omega_R2 = A*D + C*B; 


H = - omega.“2 + lj*omega.^damping + omega_R2; 

7. 

response = (A*D+C*B)./abs(H); 
response_dB = 10*log(response); 
h = plot(freq_GHz,response_dB); 

xlabel(’frequency [GHz] ’ , ’FontSize’, 14); 7. size of x label 

ylabel(’Response function [dB] ’FontSize14) ; 7. size of y label 

set(gca,’FontSize’,14); 7. size of tick marks on both axes 

axis([f_min f_max -20 20]); 

text(2, 13, ’\epsilon = 0’,’Fontsize’,14) 

text(2, 10, ’\epsilon = 1.5 \times 10~{-17}’,’Fontsize’,14) 

text(2, 7, ’\epsilon = 5 \times 10“{-17>’,’Fontsize’,14) 

pause 

close all 
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Listing 7A.2.2 Function param_rate_eq_QW.m. Function which contains rate equation 
parameters for a single quantum well. 


7, File name: param_rate_eq_QW.m 
7, Purpose: 

7. Contains parameters for rate equation model for QW active region 
Y, Source 

7. L.A. Coldren and S.W. Corzine, 

7. "Diode Lasers and Photonic Integrated Circuits", Wiley 1995. 

7. General constants 

7. velocity of light [cm/s] 

7. elementary charge [C] 

7. Planck constant [J s] 

7. Dirac constant [J s] 


c = 3dl0; 

q = 1.6021892d-19; 
h_Planck = 6.626176d-34; 
hbar = h_Planck/(2.0*pi); 
7. Geometrical dimensions 
7. QW active region 
L = 250d-4; 
w = 2d-4; 


7. cavity length [cm] ; 250 microns 

7. active region width [cm] ; 2 microns 

7. thickness of an active region [cm] ; 80 Angstroms 


7.V = L*w*d; 
V = 4d-12; 
1 


conf = 0.03; 

V_p = V/conf; 
ref_index = 4.2; 


7, volume of active region 

7. confinement factor [dimensionless] 
7. cavity volume [cm~3] 

7. effective mode index 


v_g = c/ref_index; 
tau_p = 2.77d-12; 
tau = 2.71d-9; 
beta_sp = 0.8d-4; 

1 

a = 5.34d-16; 

N_tr = 1.8dl8; 
eta_i = 0.8; 
epsilon = ld-17; 
current_th = l.lld-3; 


7. group velocity [cm/s] 

7. photon life-time [s] 

7. carrier life-time [s] 

7. spontaneous emission factor 

7. differential gain (linear model) [cm~2] 
7. carrier density at transparency [cnT-3] 


7. current at threshold [A]; 1.11 mA 


7. Parameters needed to determine output power 

alpha_m = 45; % mirror reflectivity [cm~-l] 

lambda_ph = 1.3d-3; 7. laser wavelength [microns]; 1.3 microns 

freq_ph = v_g/lambda_ph; 7« phonon frequency 
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Listing 7A.3.1 Function large_signal.m. Driver program for large-signal analysis based 
on rate equations. 

7. File name: large_signal.m 
7, Purpose: 

l Driver which controls computations for large signal rate equations 
l function z = large_signal 


clear all 

tspan = [0 2d-9]; l time interval, up to 2 ns 

yO = [0, 0]; l initial values of N and S 


[t,y] = ode45(’eqs_large’,tspan,yO); 
% 


t=t*ld9; 
y_max = max(y); 
yl = y_max(l); 
y2 = y_max(2); 

h = plot(t,y(:,1)/yl,’-.’, t, y(:,2)/y2); % 

set(h,’LineWidth’,1.5) ; 7. 

xlabelC’time [ns]’,’FontSize’,14); 7. size of 

ylabel (’ Arbitrary units ’ , ’ FontSize ’ , 14) ; 7. 

set (gca,’FontSize’, 14) ; 7. size 

legend(’carrier density’, ’photon density’) 7 
pause 
close all 


divided to normalize 
thickness of plotting lines 
x label 

size of y label 
3 of tick marks on both axes 
/. legend inside the plot 


Listing 7A.3.2 Function eqsJarge.m. Function defines rate equations equations for large- 
signal analysis. 

7, File name: eqs_large.m 
7. Purpose: 

7, Large signal rate equations are established 
7. N == y (1) 

7. S == y (2) 

7. assumed linear gain model 
function ydot = eqs_large(t,y) 

7. 

param_rate_eq_QW 7. input of needed parameters 

7. 

current = 3d-2; 7. bias current (step function) [A] ; 3 mA 

A = v_g*a*(y(l) - N_tr)/(l+epsilon*y(2)); 

ydot(1) = eta_i*current/(q*V) - y(l)/tau - A*y(2); 

ydot(2) = conf*A*y(2) - y(2)/tau_p + conf*beta_sp*y(l)/tau; 

ydot = ydot ’ ; 7. must return column vector 
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L 


As signals travel in optical communication systems, they are attenuated by optical fibre. 
Eventually, after some distance they can become too weak to be detected. One possible way 
to avoid this situation is to use optical amplifiers to increase the amplitude of the signal. 
An optical amplifier is an optical device for amplifying signals propagating over several 
channels in optical fibre to compensate for their loss during propagation. Signals also get 
distorted, say by presence of noise, and the amplification process does not clean them nor 
reshape them. The cleaning process is commonly known as regeneration and will not be 
discussed here. 

In this chapter we will concentrate on optical amplifiers. We review their general prop¬ 
erties and will also discuss erbium doped fibre amplifiers (EDFA). Semiconductor optical 
amplifiers (SOA), which are essentially semiconductor lasers without mirrors so they do 
not lase, will be discussed in the next chapter. 

Possible applications of optical amplifiers are illustrated in Fig. 8.1. Some of the basic 
applications are listed below: 

a) as in-line amplifiers for power boosting, 

b) as pre-amplifiers to increase the received power at the receiver, 

c) as power amplifiers to increase transmitted power, 

d) as a power booster in a local area network. 

The main types of optical amplifiers are: 

• doped fibre amplifiers 

• Raman and Brillouin amplifiers, and 

• semiconductor optical amplifiers. 

Issues which will be discussed in relation to those amplifiers are: gain and gain bandwidth, 
gain saturation, and noise and noise figure. 

In short, the operation and characteristics of all types of amplifiers are: 

1. population inversion is created, which means that more systems (atoms, molecules) are 
in a high energy state than in a lower one, 

2. the incoming pulses of signal induce stimulated emission, 

3. amplifiers saturate above a certain signal power, 

4. amplifiers add noise to the signal. 

In this book we will only describe two types of optical amplifiers: semiconductor optical 
amplifiers (SOA) and erbium doped fibre amplifiers (EDFA). Their general characteris¬ 
tics are compared in Table 8.1 where we summarize essential properties of EDFA and 
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Table 8.1 Some properties of EDFA and SOA. 1 

Properties of amplifiers 

EDFA 

SOA 

Active medium 

Er 3+ ion in silica 

Electron-hole 

in semiconductors 

Typical length 

Few metres 

500 pm 

Pumping 

Optical 

Electrical 

Gain spectrum 

1.5-1.6 pm 

1.3-1.5 pm 

Gain bandwidth 

24-35 nm 

100 nm 

Relaxation time 

0.1-1 ms 

<10-100 ps 

Maximum gain 

3-50 dB 

25-30 dB 

Saturation power 

>10 dBm 

0-10 dBm 

Crosstalk 

- 

For bit rate 10 GHz 

Polarization 

Insensitive 

Sensitive 

Noise figure 

3-4 dB 

6-8 dB 

Insertion loss 

<1 dB 

4-6 dB 

Optics 

Pump laser diode couplers, 

Antireflection coatings, 


fibre splice 

fibre-waveguide coupling 

Integration 

No 

Yes 



amplifier 



Possible applications of optical amplifiers: as an in-line amplifier (top) or as a preamplifier (bottom). TX stands for 
transmitter, RX for receiver. 


SOA, after Yariv and Yeh [1] (reproduced with permission). Next, we will discuss general 
characteristics of optical amplifiers. 

Some books on the subject are: Desurvire [2], Becker et al. [3] and Bjarklev [4], 


8.1 General properties 


Amplification in optical amplifiers is through stimulated emission (the same as in lasers). 
One can therefore consider similar characteristic parameters, like gain and its spectrum, 
bandwidth, etc. We will discuss them in some detail. 
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8.1.1 Gain spectrum and bandwidth 


To model gain, one typically starts with a homogeneously broadened two-level system. 
Local gain coefficient for such system is [5] 


g(v,P) 


go 



( 8 . 1 ) 


where go is the peak value of the unsaturated gain, vo atomic transition frequency, A vo 3-dB 
local gain bandwidth, P sat saturation power and P and v are optical power and frequency of 
the amplified signal. 

Local gain can also be written as 


g (co, P) = 


_go_ 

! + («»- co 0 ) 2 T 2 2 + P/P sat 


( 8 . 2 ) 


where co = 2jtv is the angular frequency and T 2 is known as dipole relaxation time [6], 

For small signal power one can consider a unsaturated regime defined as P <5C P sat . In 
this limit local gain is 


g(«) = 


go 

1 + (co - C0o) 2 T 2 


(8.3) 


The following conclusions can be derived from the above equation: 


1. maximum gain corresponds to transitions with angular frequency co = &> 0 , 

2. for co ^ coo gain spectrum is described by Lorentzian profile, 

3. local gain bandwidth, which is defined as the full width at half maximum (FWFIM), is 

A<w 0 = (8.4) 

h 

Local gain bandwidth Acoo is defined by points in frequency where local gain takes half the 
value at the maximum. In terms of frequency it can be written as 


Ay ° = = 3F 


(8.5) 


Let P (z) be the optical power at a distance z from the input end. Its change is described 


—^=g(v,P)-P(z) (8.6) 

dz 

Assume a linear device (here for power levels P <JC P sat ) where local gain is independent of 
the signal power. Integration of the above equation gives 

P(z)=P( 0)^ z (8.7) 


where P( 0) = P m is the signal input power. Linear amplifier gain is defined as 


G 


P{L) 
P( 0) 


( 8 . 8 ) 
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where P(L) = P out is the output power. From the above solution one obtains 


G=^W = e 
Pi 0 ) 


Amplifier bandwidth B 0 is evaluated using the above solution. It is defined by two frequency 
points where power drops by 50%, i.e. P^b = \Pmm(L), which translates into G^b — 
2 G max • Here G max is the maximum value of gain evaluated at v = vq. In detail 


exp 




2 


exp (g 0 ■ L) 


where we have introduced 3 — dB bandwidth Bo as Bo = v — vo. By the straightforward 
algebra, from the above relation one finds 


B 0 = 


goL — In 2 


( 8 . 10 ) 


Macroscopic bandwidth of the amplifier Bq is smaller than the local gain bandwidth Avo. 


8.1.2 Gain saturation 


We will now analyse the gain when signal power has large value and saturation effects are 
becoming important. Assume that w = co 0 . Substituting Eq. (8.2) into (8.6), one obtains 


= goP 

dz 1 + P/P sa 


( 8 . 11 ) 


We introduce new variable u = P/P sat and use separation of variables method to integrate 
the above equation. One obtains 




where u in = Pi„/P sa t, u a ut = Pout/Psat and P m , P oul are input and output powers, respectively. 
L is the length of an amplifier. Gain G is defined as 


( 8 . 12 ) 

One obtains 

G=G 0 exp(-^^) (8.13) 

where Go = exp (g 0 ■ L). 

In Fig. 8.2 we have plotted saturation gain dependence from Eq. (8.13). MATLAB code 
and its description are provided in Appendix, Listings 8 A. 1.1, 8A.1.2 and 8A.1.3. 
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Normalized output power, P ou /P sat 


Saturated normalized amplifier gain G/Gq as a function of the normalized output power for three values of the 
unsaturated amplifier gain Go. 


8.1.3 Amplifier noise 


Signal-to-noise ratio (SNR) of optical amplifiers is degraded because spontaneous emission 
adds to the signal during its amplification. Amplifier noise figure F„ is defined as [7] 


(, SNR) in 
(SNR) mit 


(8.14) 


SNR refers to the electrical power generated when the signal is converted to electrical 
current by using a photodetector. We model F n by considering an ideal detector limited only 
by a shot noise 

(, SNR) in = (8.15) 

a s 


where (/) = RP m is the average photocurrent, R = ^ is the responsivity of an ideal detector 
with unit quantum efficiency, aj = 2 q ( RP in ) A/ is the variance from shot noise and A/ is 
the detector bandwidth. At the output, we should add spontaneous emission to the receiver 
noise 


S sp (v) = {G- 1) n sp hv 


(8.16) 


Here S sp is the spectral density of the noise induced by spontaneous emission, v is optical 
frequency and n sp is the spontaneous-emission factor or population inversion factor. The 
value of n sp is n sp — 1 for amplifiers with complete population inversion (all atoms in the 
upper state) and n sp > 1 for incomplete population inversion. 

For a two-level system 


N 2 

N 2 -Ni 


(8.17) 
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Wavelength (nm) 


Absorption and gain spectra of Ge silicate amplifier fibre. Copyright (1991) IEEE. Reprinted, with permission from C. R. 
Giles and E. Desurvire, 7. Lightwave TechnoL, 9,271 (1991). 


where N\ and N 2 are the atomic populations in the lower and upper states, respectively. 
Total variance of the shot noise plus spontaneous emission noise is thus 

a 2 = 2q (RGP m ) A/ + 4 (GRP m ) (RS sp ) A/ (8.18) 


All other contributions to the receiver noise are neglected. At the output, the SNR of the 
amplified signal is 


(SNR) 0Ut = 


(Jf 

a 2 


{RGP in f _ GP ln 
V 2 * 4S sp Af 


(8.19) 


assuming G 1 and we neglected first term in (8.18). 
Using definition of F n , one finds 


F n = 2n sp -—- » 2n sp (8.20) 

G 

It shows that even for an ideal amplifier (n sp =1), amplified signal is degraded by a factor 
of 2 (3 dB). In practice, F n is in the range 6-8 dB. 


8.2 Erbium-doped fibre amplifiers (EDFA) 


Experimental data on absorption and gain of an EDFA whose core was codoped with Ge 
to increase the refractive index are shown in Fig. 8.3, after Giles and Desurvire [8], 

An optical fibre amplifier consists of an optical fibre where amplification takes place 
which is doped with a rare-earth element, and a pumping light supply system for supplying 
pumping light to the optical fibre for amplification, see Fig. 8.4. The pumping light supply 
system usually includes a semiconductor laser and an optical coupler for guiding the 
pumping light into the optical fibre for amplification. 
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Erbium fibre 


Signal ir 
1550 nm 


. CD , 


Pump Wavelength-selective 
couplers 


Isolator 



Signal o 




Schematic of EDFA. 


980 nm 




Non-radiative 


1530 nm 


1530 nm 


(a) (b) 

| Illustration of amplification in EDFA using a three-level model. 


(c) 


<5. 

i 


a s <p s 



Illustration of notation for a three-level system. 


Erbium-doped amplifiers are made by doping a segment of the fibre with erbium and then 
exciting the erbium atoms to a high energy level through the introduction of pumping light. 
The energy is transferred gradually to signal light passing through the fibre segment during 
excitation, resulting in an amplification of the signal light upon exit from the amplifier. 
Fibre optic amplifiers can amplify signal light including one or more wavelengths within a 
predetermined wavelength band without converting them into an electrical signal. 

Illustration of the amplification process in EDFA based on the three-level model is 
shown in Fig. 8.5 for the case of a 980 nm pump. A pump photon at a 980 nm wavelength 
is absorbed by an erbium ion in the ground state and jumps into the highest energy level. 
Then, through a non-radiative decay the ion loses its energy and arrives into a metastable 
state. Once there, a photon having wavelength of 1530 nm can force a stimulated transition 
of the ion into its ground state, creating one additional photon and an amplification. 

In this section, we will describe EDFA using a three-level model. We follow Becker 
[3]. The model is shown in Fig. 8.6. Level 1 is a ground state, level 2 is known as a 
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metastable one (it has a long lifetime) and level 3 is an intermediate state. The population 
of levels are introduced as M, M2 , M3 . The spontaneous transition rates of the ion (transition 
probabilities) which include radiative and also non-radiative contributions are denoted as 
T32 and P2i and correspond to transitions between levels 3 —» 2 and 2 —> 1, respectively, 
r t p is the pump absorption cross section and a s is the signal emission cross section. The 
incident light intensity fluxes of pump and signal are denoted by <p p and <p s , respectively. 
They are defined as number of photons per unit time per emit area. 

This three-level model represents energy level structure of Er + which is involved in the 
amplification process. To obtain amplification, we need a population inversion between 
levels 1 and 2. Here, we only consider a one-dimensional model where we assume that the 


pump and signal intensities and also distribution of Er ions are constant in the transverse 
direction. 

Based on the above observations, the rate equations for the changes of populations for 
all levels are postulated as 

^ m T 2 xN 2 - (Mi - M 3 ) a p cPp + (M 2 - Mi) oA 
at 

(8.21) 

^ = -r 2 iM 2 + r 32 M 3 - (M 2 - mo a s <t> s 

(8.22) 

d ^ = -r 32 N 3 + (N t -N 3 )a p <t> p 

(8.23) 

8.2.1 Steady-state analysis 


In the steady-state conditions 

dNi dN 2 dN 3 Q 

dt dt dt 

Also, total population M is assumed to be constant: 

(8.24) 

M = Mi +M2+M3 

(8.25) 

From Eq. (8.23) one obtains 


3$ 

11 

£ 

+ 

m 

(8.26) 


In what follows we will assume fast decay from level 3 to level 2; i.e. decay from level 
3 is dominant compared to pump rate. Mathematically, the assumption corresponds to the 
following condition 1^32 a p <p p . Lifetime of level 3,132 is related to transition probability 
as T32 = 1 /r 32 . In this limit there is almost no population of level 3, and therefore M3 sa 0. 
With those assumptions the system can be effectively considered as consisting of two levels 
only which are described by the following equations: 

(T p (j)p + <T s <p 

F21 T - a s (j) 

Ni+N 2 = N 


-Mi - N 2 = 0 


(8.27) 

(8.28) 
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Solutions by standard methods give for population inversion 

°P<l>p-rn 


r 2 i + 2 cr s (j) s + (T p (j) p 


(8.29) 


8.2.2 Effective two-level approach 


Keeping the above assumption, i.e. r 32 » o p (p p which allows us to neglect level 3, from 
Eq. (8.28) one has 

dN\ _ dN 2 
~df ~ ~~dt~ 

It is therefore enough to consider only one equation, say for N 2 ; the other population N\ 
can be found from N\ = N — N 2 . Following Pedersen et al. [9], to describe the system we 
postulate the following equations: 

^ = -T 2l N 2 + [c - <T S ( e) N 2 \ <t> s - [cr«A& - *, W M] <t> P (8.30) 

^ = r 2 iiV 2 + [a^N 2 - <T s (a) M] 4>, - \^ a) Ni - oplV 2 ] <t> P (8.31) 

where a^ a \ rrj e> , cx^ a \ (T p e> represent signal and pump cross sections for absorption and 
emission, respectively. Assume steady-state and from Eq. (8.30) determine N 2 : 


N 2 =N- 


where we have introduced r = 1 /r 2 i. Further, introduce signal I s and pump I p intensities 


where h is the Planck constant and ly, v p are frequencies of the signal and pump, respectively. 
This allows us to write 

r^/ s (z) + r %-I p (z) 
n 2 =N —---p-- 

T °* mT* X ° P h» p P 4( z ) + 1 

We further assume that N is independent of distance along fibre z. The variation of signal 
and pump intensities is described as 

dz L J 


= [a&N 2 - <rj a) fVi] I P {z) 








Gain characteristics of erbium-doped fibre amplifiers 


Table 8.2 Typical parameters of a low-noi: 

se in-line EDFA. 


Parameter 

Symbol 

Value 

Unit 

Signal mode area 

7tW 2 

1.3 x 10- 11 

m 2 

Erbium concentration 

N,ot 

5.4 x 10 24 

m -3 

Signal overlapping integral 

r s 

0.4 


Pump overlapping integral 


0.4 


Signal emission cross section 

Sse 

5.3 x 10“ 25 

m 2 

Signal absorption cross section 

S m 

3.5 x 10“ 25 

m 2 

Pump absorption cross section 

Sp 

3.2 x 10“ 25 

m 2 

Signal local saturation power 

P,, 

1.3 

mW 

Pump local saturation power 

P sp 

1.6 

mW 


8.3 Gain characteristics of erbium-doped fibre amplifiers 


We outline a basic approach to evaluate the performance of EDFA. Neglecting amplified 
spontaneous emission (ASE) and assuming copropagation configuration, the equations 
describing steady-state are 


2nT s {a se N me (z)I s (z) - cr sa N g Az)I s (z)\ 

(8.32) 

dIJz) 


= 2TzY p a sa N g Az)l p (z) 

(8.33) 


(8.34) 

,ot \+I p {z)/J sp + 2J s {z)/J ss 


N me (z) + Ngr(z ) = N tot 

(8.35) 


In the previous equations N gr (z) is the population of the ground state, N me (z) is the pop¬ 
ulation of the metastable state, T p (z) is the intensity of the pump wave propagating at the 
wavelength k p and I s (z) is the intensity of the signal wave propagating at the wavelength 
X s in the positive z-direction. 

Finally, the overall amplifier gain G is obtained using the following relation: 


where L is the doped fibre length. 


(8.36) 


8.3.1 Typical EDFA characteristics 


The above equations were solved numerically using parameter values from Table 8.2 [8], 
[10], [1 1], MATLAB codes are provided in Listings 8A.2.1-8A.4.2. 

The variation of gain with fibre length was determined first and the results are summarized 
in Fig. 8.7 for different values of pump power. MATLAB code is shown in the Appendix, 
Listings 8A.2.1-8A.2.4. Constant signal input power of 10 qW and constant erbium doping 
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density have been assumed. Gain was evaluated for four different pump power levels equal 
to 3, 5, 7 and 9 mW. As it is shown, gain increases up to a certain fibre’s length and then 
begins to decrease after reaching a maximum point. Similar results were reported in the 
literature, compare [10] and [11], 

The optimum fibre length (the one which corresponds to a maximum gain) is a few metres 
and it increases with the pump power. The reason for the decrease in gain is insufficient 
population inversion due to excessive pump depletion. 

In Fig. 8.8 we show the variation of gain with pump power for three different values of 
fibre lengths of 5, 10 and 15 m and a constant signal input power equal to 1 mW. MATLAB 
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code is shown in the Appendix, Listings 8A.3.1 and 8A.3.2. Constant Er doping density 
was also assumed. As it is seen, gain of the EDFA increases with the increasing pump 
power and then saturates after a certain level of pump power. The pump saturation effect 
occurs for input powers in the range 3-6 mW. Consult literature for more discussion of the 
obtained results [10] and [11]. 


8.4 Problems 


1. An EDFA has 20 dB of gain. If the power of incident signal is 150 |xW, what is the 
output power? 

2. An optical amplifier amplifies an incident signal of 1 p.W to 1 mW. Assuming the 
saturation power of 20 mW, what will be the output power of 100 p.W incident signal 
on the same amplifier? 

3. An optical amplifier produces power gain of 1000 over 30 m with a 10 mW pump, 
(a) What is the gain exponential coefficient a in nepers per metre? (b) What should 
be the length of that amplifier to produce gain of 1500? 

4. Using steady-state amplifier equations and appropriate approximations, derive 
an analytical expression for an optimum length of fibre which gives maximum 
amplification. 


8.5 Projects 


1. Perform a literature survey on EDFA with noise. Write a MATLAB program to simulate 
EDFA with realistic noise. 

2. Perform a literature survey on Raman amplifiers. Write MATLAB code to simulate 
Raman amplifier. Compare Raman amplifier and EDFA. 

3. Formulate transient model of EDFA with the saturation effect of the forward and back¬ 
ward amplified spontaneous emission [12], Integrate numerically the resulting equations. 
Analyse gain saturation and recovery times of an EDFA and the effects on the amplifi¬ 
cation of optical pulses. 


Appendix §A: MATLAB listings 


In Table 8.3 we provide a list of MATLAB files created for Chapter 8 and a short description 
of each function. 
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Table 8.3 List of MATLAB functions for Chapter . 

Listing 

Function name 

Description 

8A.1.1 

sat .gain, m 

Creates data for saturated gain 

8A.1.2 

f sat-gain.m 

Function used by sat-gain.m 

8 A. 1.3 

sat-plot.m 

Function plots saturated gain 

8A.2.1 

gain variable length.m 

Generates date for plots gain vs fibre 
length for fixed gain 

8A.2.2 

edfa-param.m 

Parameters for EDFA 

8A.2.3 

edfasqs.m 

Equations for EDFA 

8A.2.4 

variable length plot.m 

Plots gain using data generated by 
gain-variableJength.m 

8A.3.1 

gain-variable-pump.m 

Generates data for plots gain vs power 
for fixed fibre length 

8A.3.2 

variable-gain-plot.m 

Plots gain using data generated by 
gain-variable-pump.m 


Listing 8A.1.1 Function sat_gain.m. MATLAB program which generates data for satu¬ 
rated normalized amplifier gain Go- To generate the required data, it must be run three times 
for three values of Go- Therefore one should appropriately comment inputs for Go- Each 
time a different output file must be specified; i.e. the output file name should be changed. 

7. File name: sat_gain.m 

7. Generates data for plots of saturated gain G as a function of 
7, the output power for three values of the unsaturated amplifier gain G_0 
7. File must be run three times for three values of G_0 
clear all 
1 

G_0 = 10; 

7.G_0 = 100; 

7.G_0 = 1000; 
u_init = ld-2; 

Delta_u = 0.5d-l; 
u_final = ldl; 

yy = fzero(@(y) f_sat_gain(y, u_init, G_0), 0.5); 
for u = (u_init + Delta_u):Delta_u:u_final 
uu = [uu, u] ; 

y = fzero(@(y) f_sat_gain(y, u, G_0), 0.5); 

yy = [yy. y]; 

l 

d_u = length(uu); 

fid = fopen(’gain_data_10.dat’, ’wt’); 
for ii = l:d_u 


7. Open the file. 
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fprintf (fid, ’ %il.4f 7,11.4f\n’, uu(ii) ,yy(ii)) ; 7. Print the data 
status = fclose(fid); 7. Close the file. 

Listing 8A.1.2 Function f_sat_gain.m. MATLAB function used by a program which plots 
saturated gain. 

7. defines function used in plot of saturated gain of optical amplifier 
function fun = f_sat_gain(y, u, G_0) 
fun = y - exp(-(G_0*y-l)*u/(G_0*y)); 

Listing 8A. 1.3 Function sat plot.m. MATLAB function used to plot saturated gain from 
files. It was not possible here to create one MATLAB program which will perform all the 
calculations and also to plot all the results. Instead, we had to run sat gain.m three times 
for three different values of Go, output the results to data files and then plot final results 
using a separate program. 

7, File_name: sat_plot.m 

7, Plots graph of saturated gain using data generated by sat_gain.m 
clear all 

fid = fopen(’gain_data_10.dat 1 ); 

a_10 = fscanf (fid,’7,f 7,f’,[2 inf]); 7, It has two rows 
fclose(fid); 

7. 

fid = fopen(’gain_data_100.dat’); 

a_100 = fscanf (f id,’7,f 7,f’,[2 inf]); 7. It has two rows 
fclose(fid); 

7. 

fid = fopen( , gain_data_1000.dat ) ); 

a_1000 = fscanf (fid,’7,f 7,f 1 , [2 inf]); 7, It has two rows 
fclose(fid); 

7. 

semilogx(a_10(l,:),a_10(2,:),’k-’, a_100(l,:),a_100(2,:),’k--’,... 

a_1000(1,:),a_1000(2,?),»k-.’,’LineWidth 1 ,1.5) 

7. 

xlabel(’Normalized output power, P_{out}/P_{sat}','FontSize’,14); 
ylabel(’Normalized amplifier gain, G/G_0’,’FontSize’,14); 
legend(’G_0 = 10’,’G_0 = lOO’.'GJ) = 1000’) 

set(gca,’FontSize’,14); 7, size of tick marks on both axes 

pause 

close all 
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Listing 8A.2.1 Function gain_variable_length.m. MATLAB function used to compute 
gain versus fibre length for a fixed value of pump power. 

7. File name: gain_variable_length.m 

7. Computation of gain vs fibre length for fixed value of pump power 
7. User selects pump power which is controlled by P_p 
7. Calculations are repeated for P_p = 3,5,7,9 d-3 Watts 
7. User should also appropriately rename output file ’gain_P_3.dat’ 

7. based on Iannone-book, p.86 
7, P_s == y(l) signal power 
7. P_p == y(2) pump power 

7. 

clear all 
gain = 0.0; 
gain_temp = 0.0; 

P_s = 100d-7; 7. signal power [Watts] 

P_p = 9d-3; 7. CHANGE 7, pump power [Watts] 

7. 

for d_L = 0.01:0.01:10 
span = [0 d_L] ; 

y0 = [P_s P_p] ; 7. initial values of [P_s P_p] 

[z,y] = ode45(’edfa_eqs’ ,span,y0) ; 7. z - distance 

len = length(z); 

gain_temp = y(len)/y(l); 

gain = [gain, gain_temp]; 

end 

gain_log = log(gain); 
d_L_plot = 0.01:0.01:10; 
d_L_plot = [0, d_L_plot]; 

7, plot(d_L_plot, gain_log) 7. uncomment if you want to see plot when run 
7. axis ( [0 4 0 5]) 

7. pause 
7. close all 

7. 

d_u = length(d_L_plot); 

fid = fopen(’gain_P_9.dat’, ’wt’); 7. CHANGE 7. Open the file, 

for ii = l:d_u 

fprintf (fid, ’ 7.11.4f 7.11.4f\n’, d_L_plot(ii) ,gain_log(ii)) ; 
status = fclose(fid); 7. Close the file 
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Listing 8A.2.2 Function edfa.param.m. MATLAB function contains parameters for a 
model of EDFA. 


7. File name: edfa_param.m 
1 - 


7. Purpose: 

*/. Contains parameters for model of EDFA based c 
'/, Iannone-book 


l Table 3.2 


N_tot = 5.4d24; 
Gamma_s = 0.4; 
Gamma_p = 0.4; 
s_se = 5.3d-25 
s_sa = 3.5d-25 
s_p = 3.2d-25 
P_ss = 1.3d-3; 
P_sp = 1.6d-3; 


7. Erbium concentration (m~-3) 

% Signal overlapping integral (dimensionless) 
7. Pump overlapping integral (dimensionless) 

% Signal emission cross-section (m~2) 

7. Signal absorption cross-section (m~2) 

'/, Pump absorption cross-section 
*/, Signal local saturation power (W) 

7. Pump local saturation power (W) 


Listing 8A.2.3 Function edfa_eqs.m. MATLAB function contains equations for EDFA. 

function y_z = edfa_eqs(z,y) 

7. Purpose: 

7. To establish equations for EDFA following Iannone-book, p.86 
7. P_s == y(l) signal power 
7. P_p == y(2) pump power 

7. 

edfa_param 7o input of needed parameters 

num = y(l)/P_ss + y(2)/P_sp; 

denom = y(2)/P_sp + 2*y(l)/P_ss + 1.0; 

N_me = N_tot*num/denom; 

N_gr = N_tot - N_me; 

7. 

y_z(l) = 2*pi*Gamma_s*(s_se*N_me*y(l) - s_sa*N_gr*y(1)); 7o derivative 
y_z(2) = -2*pi*Gamma_p*s_p*N_gr*y(2) ; 

y_z = y_z’; 7. must return column vector 

end 


Listing 8A.2.4 Function variable_length_plot.m. MATLAB function used to plot graph 
of gain using data generated by gain-variable Jength.m. 

7, File_name: variable_length_plot .m 

7. Plots graph of gain using data generated by ’gain-variable-length.m’ 
clear all 

7. Open files for 4 values of length of device: 5 m,10 m,15 m 
fid = fopen(’gain_P_3.dat’); 

a_3 = f scanf (f id, ’7.f %f ’ , [2 inf]); "/, It has two rows 
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f close (fid) ; 

7. 

fid = fopen(’gain_P_5.dat’); 

a_5 = fscanf (fid,’7.f 7.f’,[2 inf]); 7. It has two rows 
f close (fid) ; 

7. 

fid = fopen(’gain_P_7.dat’); 

a_7 = fscanf (fid,’7.f 7.f’,[2 inf]); 7. It has two rows 
f close (fid) ; 

7. 

fid = fopen(’gain_P_9.dat’); 

a_9 = fscanf (fid, ; 7.f 7.f’,[2 inf]); 7. It has two rows 
f close (fid) ; 

1 

plot(a_3(1,:),a_3(2,:),’k-’, a_5(l,:),a_5(2,:),’k—’,... 

a_7(l,:),a_7(2,:),’k-.’,a_9(l,:),a_9(2,:),’k-’,’LineWidth’,1.5) 
axis( [0 4 0 10]) 

xlabel(’Doped fiber length (m)’,’FontSize’,14); 
ylabel(’Gain (a.u)’,’FontSize’,14); 
legend(’3 mW’,’5 mW’,’7 mW’,’9 mW’) 

set(gca,’FontSize’,14) ; '/, size of tick marks on both axes 

close all 


Listing 8A.3.1 Function gain variable pump.m. MATLAB function used to compute gain 
versus pump power for fixed value of fibre length. 

7. File name: gain_variable_pump.m 

7, Computation of gain vs power pump for fixed value of fiber length 
7. User selects fiber length which is controlled by d_L 
7. Calculations are repeated for d_L = 5,10,15 meters 
7. User should also appropriately rename output file ’gain_L_5.dat’ 

7. Based on Iannone-book, p.86 
7. P_s == y(l) signal power 
7. P_p == y(2) pump power 
1 

clear all 
gain = 0.0; 
gain_temp = 0.0; 

P_s = 100d-5; 7. signal power [Watts] 

d_L = 5; 7. CHANGE 7. fiber length [meters] 

span = [0 d_L] ; 7. region of integration [meters] 

for P_p = 0. ld-3:0. Id-3:10d-3 7. pump power [Watts] 

y0 = [P_s P_p] ; 7. initial values of [P_s P_p] 

[z,y] = ode45(’edfa_eqs’ ,span,yO) ; 7« z - distance 

len = length(z); 
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gain_temp = y(len)/y(l); 
gain = [gain, gain_temp]; 

end 

gain_log = log(gain); 

P_p_plot = 0.1:0.1:10; 

P_p_plot = [0, P_p_plot]; 

7. plot(P_p_plot, gain_log) "/, uncomment if you want to see plot when run 
7. axis ([0 10 0 30]) 

7. pause 
'/, close all 

7. 

d_u = length(P_p_plot); 

fid = fopen(’gain_L_5.dat’ , ’wt’); % CHANGE 7. Open the file, 

for ii = l:d_u 

fprintf (fid, ’ 7.11.4f 7.11.4f\n’, P_p_plot(ii) ,gain_log(ii)) ; 
status = fclose(fid); 7. Close the file 

Listing 8A.3.2 Function variable_pump_plot.m. MATLAB function used to plot graph of 
gain using data generated by gain -variable -pump.m. 

7. File_name: variable_pump_plot .m 

7. Plots graph of gain using data generated by , gain_variable_pump.m’ 
clear all 

7. Open files for 4 values of length of device: 5 m,10 m,15 m 
fid = fopen(’gain_L_5.dat’); 

a_5 = f scanf (f id, ’7.f 7.f ’ , [2 inf]); 7. It has two rows 
fclose(fid); 

7. 

fid = fopen(’gain_L_10.dat’); 

a_10 = fscanf (fid,’7.f 7.f’,[2 inf]); 7. It has two rows 
fclose(fid); 

7. 

fid = fopen(’gain_L_15.dat’); 

a_15 = f scanf (fid,’7.f 7.f’,[2 inf]); 7. It has two rows 
fclose(fid); 

7. 

plot(a_5(l,:),a_5(2,:),’k-’, a_10(l,:),a_10(2,:),>k—’,... 

a_15(l,:),a_15(2,:),’k-.’,’LineWidth’,1.5) 
axis ([2 8 0 20]) 

xlabeM’Pump power (mW)’,’FontSize’,14); 
ylabel(’Gain (a.u)’,’FontSize’,14); 
legend(’5 m’,’10 m’,’15 m’) 
set(gca,’FontSize’,14) ; 7. size 

close all 


of tick marks 


both 
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9 Semiconductor optical amplifiers (SOA) 

L._ 


The field of semiconductor optical amplifiers (SOA) is one of the fast growing in recent 
years. Many new applications of SOA were proposed. For comprehensive summaries, see 
books by Connolly [1] and Dutta and Wang [2], 

In this chapter we will discuss the following topics: 

• general concepts 

• amplifier equations 

• influence of cavity (FP amplifiers) 

• gain dependence on polarization and temperature 

• some applications. 

Typical use of SOA in an amplifier configuration is shown in Fig. 9.1. Here, a signal 
from transmission fibre is imputed on SOA where it is amplified. After amplification it is 
redirected again to the fibre. Coupling between fibre and SOA is provided by an appropriate 
optical element. 


9.1 General discussion 


SOA is very similar to a semiconductor laser. There are two categories of SOA (see Fig. 9.2): 
(a) Fabry-Perot (FP) amplifier and (b) travelling-wave amplifier (TWA). The FP amplifier 
displays high gain but has a non-uniform gain spectrum, whereas TWA has broadband gain 
but requires very low facet reflectivities. The FP amplifier has large reflectivities at both 
ends which results in resonant amplification, and also has large gain at the wavelength 
corresponding to longitudinal modes of the FP cavity. 

TWA has very small reflectivities, achieved by AR (anti-reflection) coating; its gain 
spectrum is broad but small ripples exist in gain spectrum, resulting from residual facet 
reflectivity. It is more suitable for system applications but the gain must be polarization 
independent. The phenomenological expression for gain of SOA is written as 

g m = a(n - n 0 ) - a 2 (X - X p ) 2 (9.1) 

and the wavelength peak value as 

X p = X 0 + a 3 (n — no) (9.2) 

Typical values of the parameters which appear in the above formulas are given in 
Table 9.1, from [3]. 
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j Table 9.1 Basic parameters of SOA. j 

Description 

Symbol 

Value 


Differential gain 

a 

2.7 x 

10 -16 cm 2 

Gain coefficient 

a 2 

0.15 c 

:m -1 nm -2 

Gain coefficient 

«3 

2.7 x 

10“ 17 nm ■ cm“ 3 

Transparency density 

no 

1.1 X 

10 18 cm“ 3 

Amplifier length 

L 

350 pm 

Density at threshold 

Mth 

1.8 x 

10 18 cm” 3 


Optical 

input^ 


Injection current (pump) 


(=i 


7 



/ cladding | 

■ — 



1 —) 


\- 

,"* Eopu “^ 

SOA 

| Transmission f 


5 

Amplified 


SOA in an amplifier configuration. 


Injection current (pump) 


Balk , 

semiconductor y 


Injection current (pump) 


"^\\\\\\\\\\\\\\\\\\\\\\\\\J 


Balk , 

semiconductor V7 


l\\\\\\\\\\\\\\\\\\\\\™ 


Optical 

Amplified input 
optical signal 
Cleaved s, § nal 



(a) 


Types of SOA: Fabry-Perot (left) and travelling-wave (right). 


(b) 


From the previous equations, the 3 dB gain bandwidth is determined as 


2AX = 2, 


i(n - rip) 
ln 2 


(9.3) 


Substituting typical values from Table 9.1 gives SOA bandwidth equal to 54 nm. 
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Basic model of a Fabry-Perot amplifier. 

9.1.1 Gain formula for SOA with facet reflectivities 


Consider a typical Fabry-Perot ethalon with a gain medium in-between, see Fig. 9.3. Total 
output electric field E out consists of all transmitted contributions: 

E m t =Ei+E 2 +E 3 -\ - 


Here R\ and R 2 are coefficients of internal reflections for electric field, and I — /?i and 
1 — i?2 corresponding coefficients of transmission. The single-pass gain G s for field is 

G s = e r(g - a)L (9.4) 


where F is optical confinement factor, g is gain coefficient and a are internal losses. The 
longitudinal propagation constant fi z is 


& = *o ■ 

- and n g effective group index 


(9.6) 


Here v g is the group velocity. Applying the standing wave condition to an electromagnetic 
wave of wavelength A. in a resonator of length L gives 


Expressions for transmitted components are 

E\ = E 0 e- Jf> * L y/G s y/l — A, yi — R 2 
E 2 = E 0 e~ jfl *- 3L (7^) 3 V1 - R ly fR 2 jR[J 1 - R 2 
E 3 = E 0 e- J ^ 5L (/C^ 5 v/l - R [ R 1 R ] j\ - R 2 
Total field, which is determined as a sum of all the above components, is therefore 
E out = E 03 /(l - Ri) (1 - R 2 ) G s e~ J%L { 1 + G s s/R^e~ J ^ 2L 
+ G%R 2 e~ jfi ’ AL + • • ■} 

Summing geometrical series, one finally obtains 

yi - R 2 G s E Q e-J^ L 


Eout = ~ 


1 - GjTEjhe- 1 ^* 1 
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'gain 



frequency 


Gain spectra showing frequency corresponding to gain peak. 


Finally, gain of SOA with facet reflectivities R\ and Rj is 
fl -R,)(l -fMGr 



(9.8) 


Phase shift (p is obtained by assuming that the phase of the incident wave is taken as zero. 
The relation for phase change cp at the output is thus 



(9.9) 


Using (9.6), we can write the above relation for frequency 



The above relation can be interpreted on gain spectrum graph as shown in Fig. 9.4 where 
frequency v 0 corresponds to gain peak G max , which is obtained when sin (p = 0, or (p = kn. 
Using an expression for phase (9.9), one finds its value corresponding to Vo (using relation 
lAo = Vo/Vg) 



or 



2 L 

assuming k= 1. The difference in frequencies v - v 0 is determined as 



or 



— Tt 


Finally 


2n (v — vq ) L 


(9.11) 


where we have neglected jt since shur =0. 
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Reflectivity (R) 

Gain ripple as a function of reflectivity for two values of gain. 


9.1.2 The effect of facet reflectivities 


An uncoated SOA has facet reflectivities (due to FP reflections) determined by taking the 
values of refractive indices of a typical semiconductor and the air and is approximately 
equal to 0.32. Even with the AR coatings, there are some residual reflectivities which result 
in the appearance of the so-called gain ripples, see Fig. 9.5. Ripples are superimposed on 
gain spectrum. The peak-to-valley ratio between the resonant and non-resonant gains is 
known as the amplifier gain ripple G r . From Eq. (9.8) one obtains 


G r = 



(9.12) 


where G s is the single-pass amplifier gain. 

For an ideal TWA both R\ andi?2 are zero. In this case G r = 1; i.e. no ripple occurs at the 
cavity mode frequencies. The quantity G r is plotted in Fig. 9.6 as a function of reflectivity 
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R (assuming that R\ = R2) for two values of gain. One observes that gain ripple increases 
with increasing gain and also increasing facet reflectivity. The MATLAB code is provided 
in Appendix, Listing 9A.1. 


9.2 SOA rate equations for pulse propagation 


In this section we concentrate on basic rate equations describing pulse propagation in 
SOA [2], [4], Discussion involves electromagnetic field and carriers. We start with the 
development of electromagnetic equation. 

Pulse propagation inside SOA is described by the wave equation 


V 2 E (r, t ) 


£ 3 2 E (r, t) 
c 2 3 1 2 


(9.13) 


where E (r, t ) is the electric field vector, c is the light velocity and s is the dielectric constant 
of the amplifier medium which is expressed as 


s = n 2 b + x 


(9.14) 


Here n b is the background refractive index of the semiconductor and susceptibility / which 
represents the effect of charges inside an active region in the phenomenological model is 


X = - (oiH + i)g(n) 

coo 


(9.15) 


where n is the effective mode index, olh is the linewidth enhancement factor (Henry factor) 
and g(n) is the optical gain approximated here (for bulk devices) as 


g{n) = a {n — n 0 ) 


(9.16) 


where a is defferential gain, n is the injected carrier density and «o is the carrier density at 
transparency. 

In the following we assume a travelling wave semiconductor wave amplifier which 
supports single mode propagation with a perpendicular electric profile described by F (x, y ). 
The electric field E (r, t) which obeys wave equation (9.13) is expressed as 

E (r, t) = n X - [F(x,y) A(z, t) (9.17) 

where n is the polarization vector, ko = na> 0 /c and A (z, t) is the slowly varying amplitude 
of the propagating wave. We introduce the following notation: 


v 2 = vi + 


3 z 2 ’ 


where V 2 = 


3 ? 


3 ^ 


and evaluate derivatives 


V 2 E 


(Vi F)Ae ik ° z +F^4e iko: + 2F^-ik 0 e ik ° z + FA(ik 0 ) 2 e^ z 
az z dz 


(9.18) 
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In the slowly varying envelope approximation (SVEA) the term with second derivative with 
respect to z and t is neglected. Substituting (9. 1 8) into (9.13), applying SVEA with respect 
to z and t and integrating over the transverse direction gives 


and 


VlF+^(n 2 b -n 2 )F = Q 


(9.19) 


dA 

dz 


1 dA 
v g 3 1 


ico 0 r 1 

= ~ UlossA 

2 cn 2 


(9.20) 


where we have accounted for losses described by a/ Q „. The group velocity is defined 
v g = c/n g and group index n g is 


3 n 

n g = n +(Do¬ 
do) 

The confinement factor F is 


I-^dxf^dy\F(x,y) | 2 


(9.21) 


(9.22) 


where w and d are the width and thickness of the amplifier active region. At this stage, one 
simplifies Eq. (9.20) by introducing transformation to a reference frame moving with pulse 
as 


y = z 


In the new reference frame Eq. (9.20) takes the form 


Rate equation for carrier density n is in the following form [2], [4]: 

dn I n Tg{n) 2 


dt qV 




(9.23) 


(9.24) 


where / is the injection current, V is the active volume, q is the electron charge, x c is the 
carrier lifetime and a m is the cross section of the active region. In the new reference frame, 
the above equation is 


dn = jL _ 1 _ r go,) 2 

dr qV r c hoooOm 


(9.25) 


Slowly varying amplitude A (z, t ) of the propagating wave is expressed 


A = sfPe^ 


(9.26) 
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where P(z, r) and <j>(z, r) are the instantaneous power and the phase of the propagating 
pulse. Using the above equations, one obtains [2] 


dA 1 

— = - (1 + ia H )g-A 

dg _ g- go _ gP_ 
dr r c E sat 

dP 

— = (g — aioss)P 
d<p 1 

^ 


(9.27) 

(9.28) 

(9.29) 

(9.30) 


The quantity E mt is defined as E sat = r C P S , where P s is the saturation power of the amplifier 


p _ hco 0 (T m 
s ~ dTr c 

In the above go is the small signal gain 



(9.31) 


(9.32) 


Finally, the cross section a m of the active region is a m = wd. 


Pulse amplification 

Using previously defined equations, we will now analyse pulse propagation assuming zero 
losses, i.e. «/„„ = 0, and also assuming that r p r c , where r p is the width of the input pulse. 
Under this approximation pulse is so short that gain has no time to recover. Observing that 
r c = 0.2-0.3 ns for typical SOA [4], this approximation works for r p equal to about 50 ps. 
Under the above approximations, one can obtain the analytical solution of the amplifier 
equations. One first integrates Eq. (9.29) to obtain output power as 

Poutir) = Pin{r)e h{x) = PUr)G(r) (9.33) 


where P ln (r) is the input power, and 


Hr) 


i: 


g(z, r )dz 


(9.34) 


Quantity h{r) is known as the total integrated net gain. Replacing the last term in Eq. (9.28) 
using (9.29) gives 

dg _ go~g _ ] _ dP 

dr r c E sat dz 

Integrating the above over amplifier length and using (9.33) gives 


dh{r) 

dr 


(9.35) 
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The solution of the previous equation is [2], [4] 


G(r) = e* w 


Go 

G 0 - (G 0 - 1) exp [-£ 0 (r)/£«,,] 


(9.36) 


where Go is the unsaturated gain of the amplifier and the quantity E 0 (x) is given by 


-L 


(9.37) 


E 0 {x ) represents the fraction of the pulse energy contained in the leading part of the pulse 
up to x' < x. 

The above solution shows that due to time dependence of the gain, different parts of 
input pulse experience different amplification which leads to a modification of pulse shape 
after being amplified by SOA. 

In the following we will restrict our analysis to the Gaussian input pulse 


Pin(r) =P 0 exp(-x 2 ) 


(9.38) 


For the Gaussian input pulse, the expression for the quantity E 0 (z) can be found in a closed 
form as 

E 0 (x ) = P 0 x 0 ^[1 + erf{ r)] (9.39) 

where erf (t) is the error function defined as 

erf(x) = 4L f e^dx (9.40) 

V 77 Jo 

We have also used the following property of error function [5]: 

1 — erf(x ) = ~^= f e~^dx (9.41) 

V* Jt 

In Fig. 9.7 we plotted pulse shape for several values of unsaturated gain Go- One can 
observe that the amplified pulse becomes asymmetric; i.e. its leading edge is sharper 
compared to its trailing edge. 


9.3 Design of SOA 


Here, we briefly describe the main effects which must be considered when designing SOA 
with proper characteristics. Out of many issues related to the design of SOA, we briefly 
concentrate here on two: suppression of cavity resonances and polarization insensitivity. 
Consult the literature [1], [2] and [6] for more information. 


Suppression of cavity resonance 

To fabricate a travelling wave SOA, the Fabry-Perot cavity resonances must be suppressed. 
To accomplish this, the reflectivities at both facets must be reduced. Three approaches were 
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used to achieve this goal (see Fig. 9.8): (a) to put anti-reflection (AR) coating at both facets, 
(b) to tilt the active region and (c) to use transparent window regions. 

The principles of AR coating were discussed in Chapter 3. AR coating only works for a 
particular wavelength and it is not suitable for a wide bandwidth. The analysis [7] shows 
that with appropriate combination of the previously discussed methods it is possible to 
obtain an effective facet reflectivity of less than 10 -4 . Multilayer coatings can broaden 
wavelength range where there is low reflectivity. 


Polarization insensitive structures 

The state of polarization of an electric field during propagation in optical fibre changes 
randomly. Therefore, after propagation when signal is ready for amplification (say, in 
SOA), its state of polarization is unknown. For amplifications of such signals it is therefore 
desirable that SOA has polarization independent amplification. The main factor responsible 
for polarization sensitivity is the difference between confinement factors for TE and TM 
modes [ 1 ]. Proper design of polarization insensitive SOA involves several techniques which 
are summarized in [1], 

Additionally, in modern SOAs which are based on quantum well (QW) designs instead of 
bulk structures, due to their significantly reduced threshold current and increased efficiency 
there exist additional polarization related effects associated with QW. QW significantly 
suffers from polarization sensitivity, that is, a significant difference in gain between the 
transverse-electric (TE) and transverse-magnetic (TM) polarization modes. This is a major 
concern as the polarization of a signal cannot be always controlled. Several approaches 
were discussed to the design of polarization insensitive SOA [8], [9], [10]. The issue is of 
significant importance for SOA built using quantum wells. 

















233 


Some applications of SOA 


Balk 

semiconductor 


Optical 

input 

signal 


Active region 



^\\\\\\\\\\\\\\\\\\V 


\ 


Amplified 

optical 

signal 


(a) 


AR coating 







Amplified 

optical 

signal 

Optical 

aiii 




(b) 
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9.4 Some applications of SOA 


9.4.1 Wavelength conversion 


One of the important applications of SOA is for wavelength conversion (WC) [11], see 
also [1], Three main methods of WC have been analysed: cross-gain modulation (XGM), 
cross-phase modulation (XPM) and four-wave mixing (FWM). XGM will be discussed in 
detail in the following section. 

Other methods (not discussed here) include those based on the nonlinear optical loop 
mirror (NOLM) with the nonlinearity achieved by using fibre or SOA (see recent review 
[ 12 ])- 
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laser probe X cw 

i_n_n_ h 


SOA 


[ Wavelength conversion using XGM in SOA in copropagation configuration. 

signal —I I—I I— X s - 

output 1 |—| |— X, cw < 


SOA 


- Xcw 

laser probe 


Wavelength conversion using XGM in SOA in a counter-propagation configuration. 


Converted signal, Xcw 

TIU1T 


Input signal power (dBm) 

£ 


Principle of wavelength conversion in SOA using XGM. 




Cross-gain modulation 

The reduction of gain, known as gain saturation, typically occurs for input powers of 
the order of 100 pW or higher. To understand this effect, one must remember that the 
amplification in SOA is the result of stimulated emission. The rate of stimulated emission 
in turn depends on the optical input power. At high optical inj ection, the carrier concentration 
in the active region is depleted through stimulated emission to such an extent that the gain 
of SOA is reduced. 

WC based on XGM can operate in either copropagation or counter-propagation config¬ 
urations. Counter-propagation configuration does not require optical filtering of the target 
wavelength. However, counter-propagation suffers from speed limitations [13]. 

The schematic representation of the XGM used as wavelength conversion is shown in 
Fig. 9.9 (copropagation configuration) and in Fig. 9.10 (counter-propagation configuration). 
The principle of wavelength conversion employing nonlinear characteristics of SOA is 
explained in Fig. 9.11. Two optical signals enter a single SOA. One of the signals (known 
as the probe beam) at wavelength X cw is injected continuously (CW); the other (known as 
the signal beam) injected at wavelength X s is carrying digital information. 
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If the peak optical power in the modulated signal is near the saturation power of the SOA, 
the gain will be modulated synchronously with the power. When the data signal is at high 
level (a logical ONE), the gain is depleted and vice versa. This gain modulation is imposed 
on the unmodulated (CW) input probe beam. Thus, an inverted replica of the input data is 
created at the probe beam wavelength. This form of wavelength conversion is one of the 
simplest all-optical wavelength conversion mechanisms available today. 

Very fast wavelength conversion can be achieved with speeds in the order of 40 Gb s - * 
[14] with a small bit error ratio penalty. Previously, it was thought that the speed of WC is 
limited by the intrinsic carrier lifetime, which is around 0.5 ns. This is because the effective 
carrier lifetime can be decreased by the use of high optical injection to values as low as 
10 ps. 

Another effect associated with pulse propagation in long SOA (>1 mm) is that pulse 
distortion of the input data due to gain saturation effects can tend to sharpen the leading 
edge of the data pulse at the probe wavelength. 


9.4.2 All-optical logic based on interferometric principles 


For the future high-speed optical networks, several all-optical signal processing functional¬ 
ities will be required to avoid cumbersome and power consuming electro-optic conversion. 
The central role in this process is played by all-optical high-speed logic gates. 

In recent years several schemes of implementation have been investigated. Some of those 
schemes exploit gain saturation of SOA, and the other methods employ the interferometric 
configurations. 

Optical logic devices are constructed using the Mach-Zehnder (MZI) interferometer, see 
Fig. 9. 12for an introduction to the notation and symbols. A <j> is the phase difference between 
signals propagating in the upper and lower arms of MZI. Each arm of the interferometer 
contains SOA where the effect of cross-phase modulation (XPM) is utilized to change phase 
of the transmitted light. The XPM effect is based on the physical effect of the dependence 
of refractive index on the carrier density in the active region. Depending on operating 
conditions which are controlled by driving currents within each SOA and direction and 
intensity of external light, this configuration can perform like an all-optical gain. In what 
follows, for several logic gates we schematically show operating conditions of MZI, the 
resulting phase difference and the equivalent logical table [15]. 
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A, | |— v OOO 

—I Acp = 0 o 1 0 

I-1 0 1 


A block diagram of A NOT B and a truth table. 




0 

0 

0 


A block diagram of A AND B and a truth table. 


A and NOTB 

Here, bias conditions are set in such way that, when a second signal B is input in a counter- 
propagating scheme, the phase difference is zero. The configuration and logical table are 
shown in Fig. 9.13. 


AANDB 

Here, both SOA are biased such that in the absence of counterpropagating signal B, a phase 
difference of n exists at the output X. The relevant configuration and the truth table are 
shown in Fig. 9.14. 


9.5 Problem 


1. Derive an expression for gain ripple G r . 


9.6 Project 


1. Use software developed by Connelly [1] to analyse optical logical elements. 


Appendix 9A: MATLAB listings 


In Table 9.2 we provide a list of MATLAB files created for Chapter 9 and a short description 
of each function. 
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Table 9.2 List of MATLAB functions for Chapter . j 

Listing 

Function name 

Description 

9A.1 

gainj-ipple.m 

Determines gain ripples as a function of reflectivity 

9A.1 

pulse jshape.m 

Determines output pulse shape for various values of G 0 


Listing 9A.1 Program gain ripple.m. MATLAB program which determines gain ripples 
for SOA. 

7. File name: gain_ripple .m 

•/. Calculates gain ripple as a function of reflectivity- 
clear all 

N_max = 401; % number of points for plot 

R = linspace(ld-4,4d-3,N_max); % creation of theta arguments 

7. 

for G_s = [200 500] 7. gain coefficient 

G_r = (1 + G_s.*R)./(1 - G_s.*R); 
loglog(R,G_r,’LineWidth’,1.5) 
hold on 

end 

7. Redefine figure properties 
ylabel(’Gain ripple’,’FontSize’,14) 
xlabel(’Reflectivity (R)’,’FontSize’,14) 

set(gca,’FontSize’,14); % size of tick marks on both axes 

text(3d-3, 12, ’G_s = 200’,’FontSize’,14) 

text(6d-4, 12, ’G_s = 500’,’FontSize’,14) 

text(8.5, 0.1, ’r = 0.8’,’Fontsize’,14) 

pause 

close all 


Listing 9A.2 Program pulse_shape.m. MATLAB program which determines output pulse 
shape for three values of the unsaturated gain Go. 

7 , File name: pulse_shape.m 
7o Determines output pulse shapes in soa 

7o Basic equations after Agrawal and Olsson, JQE25, 2297 (1989) 
clear all 
P_0 = 1; 

w = 0.1; 7. defined as P_0*tau_0/E_sat 

tau = -3:0.01:3; 
hold on 

for G_0 = [10 100 1000] 

x = (1/2)*sqrt(pi)*w*(1 + erf(tau)); 
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G = G_0./(G_0 -(G_0-1)*exp(-x)); 
y = exp(-tau."2).*G; 
y_max = max(y); 

plot(tau,y/y_max,’LineWidth’,1.5) 

end 

xlabel(’\tau’,’FontSize’,14); 

ylabel(’Normalized output power’,’FontSize’,14); 

set(gca, ’FontSize’ ,14) ; °/ 0 size of tick marks on both axes 

grid on 
pause 
close all 
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In this chapter we summarize the operation of an optical receiver, which is an important 
part of an optical communication system. An overview of design principles for receivers 
used in optical communication systems is provided by Alexander [ 1 ]. The aim of a receiver 
is the recovery of the transmitted data. The process involves two steps [2]: 

1. the recovery of the bit clock, 

2. the recovery of the transmitted bit within each bit interval. 

A block diagram of an optical receiver is shown in Fig. 10.1 [3], [4], 

The receiver consists of a photodetector which converts the optical signal into electrical 
current. A good light detector should generate a large photocurrent at a given incident light 
power. They should also respond fast to the input changes and add minimal noise to the 
output signal. This last requirement is of crucial importance since the received signal is 
typically very weak. In digital optical communication systems the detection process is often 
conducted with a PIN photodiode. 

There are generally two types of detection [4]: direct detection (also called incoherent 
detection) and coherent detection. 

Direct detection detects only the intensity of the incident light. It is used mainly for 
intensity or amplitude modulation schemes. It can only detect an amplitude modulated 
(AM) signal. 

Coherent detection can detect both the power and phase of the incident light. It is therefore 
used when phase modulation (PM) or frequency modulation (FM) is preferred. Coherent 
detection is also important in applications such as WDM. 

The coherent detection requires a local oscillator to coherently down-convert the modu¬ 
lated signal from optical frequency to intermediate frequency (IF) [4], Coherent detection 
is not discussed here, see [4] for the extensive discussion. The incoherent detection which 
dominates in currently deployed systems is based on square-law envelope detection of the 
optical signals. 

After detection, the electrical current is often amplified (not shown in Fig. 10.1) and 
then passes through an electrical filter which is normally of the Bessel type. At that point, 
electrical eye diagrams are typically observed for the assessment of signal quality. Next, 
sampling of electrically filtered received signal is performed. The received electrical signal 
is corrupted with noise of various origins. The noise sources will be discussed in the 
following sections. 

Performance evaluation of an optical transmission system is done by evaluating optical 
signal-to-noise ratio, eye opening and bit error rate (BER) which is the ultimate indicator 
of the system’s performance. 
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Input Output 



Block diagram of an ideal optical receiver. 


10.1 Main characteristics 


Before we enter a more specialized discussion of optical receivers, let us summarize their 
main characteristics [5], which are: receiver sensitivity, dynamic range, bit-rate transparency 
and bit-pattern independency. 


10.1.1 Receiver sensitivity 


This property is a measure of the minimum level of optical power P sem at the receiver 
required for a reliable operation. Specifically, one expects that the BER is smaller than a 
specified level. Typically, that level is established to be equal to 10 -9 . 

The receiver sensitivity is a function of both the signal and also the noise parameters 
of photodetector and a preamplifier. It is a measure of the operating limit of the optical 
receiver, which, however, rarely operates close to that limit. There always exists a possibility 
of degradation of the system (temperature, ageing, etc.), so a typical margin (normally 3- 
6 dB) is established. 

Receiver sensitivity is a fundamental parameter of an optical receiver. It is directly 
responsible for the spacing between two points in any optical link, e.g. between transmitter 
and receiver or between repeaters. 


10.1.2 Dynamic range 


Dynamic range (expressed in dB) is the difference between the maximum allowable power 
and minimum power determined by receiver sensitivity. The maximum allowable power on 
the receiver is determined by nonlinearity and saturation. 

Large dynamic range is important because it allows for more flexibility in the design of 
an optical network. The design of every network should take into account the wide range 
of possible changes of received optical powers due to changes in temperature, ageing or 
various types of losses (in the fibre, connectors, etc.). 


10.1.3 Bit-rate transparency 


It refers to the ability of the optical receiver to operate over a range of bit rates. It describes 
the ability of the same receiver to be used for several networks operating at different bit 
rates. 
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10.1.4 Bit-pattern independency 


This is the property of an optical receiver determining its operation for various data formats. 
The main constraint is imposed by non-retum-to-zero (NRZ) code. 


10.2 Photodetectors 


At the end of travel through a optical fibre, the optical signal reaches a photodetector (PD) 
where it is converted into an electrical signal. In PD photon of energy hv is absorbed and 
produces photocurrent i/>: 


ip 


( 10 . 1 ) 


where P is the power of the incoming light, rj is the quantum efficiency, h is Planck’s constant 
and v = c/X is the frequency of the absorbed light. PD is similar to a semiconductor diode 
polarized in the reverse direction. Therefore, PD can be modelled as a current source. 

Another very popular detector is a human eye. It is the most popular natural detector of 
light but it has several disadvantages: it is slow, has bad sensitivity for low-level signals, 
has no natural connection to electronic amplifiers and its spectral response is limited to the 
0.4-0.7 pm range. 

Artificial (human made) optical detectors are based on two physical mechanisms: external 
photoelectric effect and internal photoelectric effect. In the external photoelectric effect, 
electrons are removed from the metal surface of electrode known as a cathode by absorbing 
energy from incident light. Then, under an electric field due to the potential difference 
between both electrodes, they travel to another electrode known as an anode, thus producing 
electrical current. Vacuum photodiodes and photomultiplier tubes operate on that principle. 

Main choices for photodetectors are the PIN (P-type Intrinsic N-type) photodiode and 
avalanche photodiode (APD). APD provides gain which increases system sensitivity but 
introduces more noise. 

In this section we will discuss only the internal photoelectric effect and its applications in 
light detection. In this effect, physical processes take place inside semiconductor junction 
devices. There, free carriers (electrons and holes) are generated by absorption of incoming 
photons, and as a result an electrical current is produced. These devices can be viewed as 
the inverse of a light emitting diode (LED). 


10.2.1 Principles of photo detection 


Photodetection using semiconductors is possible because of optical absorption. When light 
is incident on the semiconductor surface, it may or may not be absorbed depending on a 
wavelength. Absorbed optical power is described as 


dP 

dx 


a(X)P 


( 10 . 2 ) 
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Wavelength X (|_lm) 


Optical absorption coefficient as a function of wavelength for typical semiconductors. Reprinted from H. Melchior, 
Journal of Luminescence, 7,390 (1973). Copyright (1973), with permission from Elsevier. 


where a (A.) is the absorption coefficient which is wavelength dependent. The spectra of 
optical absorption for several semiconductors (including compounds) are schematically 
shown in Fig. 10.2. As can be seen, the absorption coefficient a (A.) strongly depends on 
wavelength. 

Integration of Eq. (10.2) gives the absorbed power at the distance x from the surface in 
terms of the incident optical power Pq: 

P{x) = P 0 (1 - e~ aM x ) (10.3) 

A schematic of a photodetector is illustrated in Fig. 10.3. Incident light penetrates the 
semiconductor surface and generates electrical current /. The device structure consists of 
p and n regions separated by wide intrinsic (very lightly doped) region of width w under 
large reverse bias voltage, see Fig. 10.4. 

When a photon is absorbed by the solid (and if it has enough energy, i.e. large enough 
frequency), its energy causes an electron to move from valence band (leaving one hole 
behind) to the conduction band and to create one extra electron there. The change of energy 
of an electron should be (at least) E g , which is known as the energy gap. Photons with 
smaller energies are usually not able to create electron-hole pairs, see Fig. 10.5. Therefore, 
photon energy E p = hv must be 

ch 

E p > Eg or hv >E g or X c = — 
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Semiconductor 


Schematic of a photodetector. 


photon hv 



PIN photodetector in external electrical circuit. 


p-type Intrinsic n-type 



Energy band diagram of a PIN photodiode. Generation of electron-hole pair is shown. 
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| Equivalent circuit model of a PIN photodetector. 


where X c is known as cutoff and it defines a usable spectrum. The practical formula is 
1.24 

X c = -, X c is in pm and E g in eV 

Eg 

The applied high electric field causes electrons and holes to separate. They subsequently 
flow to n (electrons) and p (holes) regions and produce flow of current in the external circuit 
known as photocurrent. 

Different semiconductor materials have different bandgaps and therefore different cutoff 
wavelengths. They are used as detectors in different parts of the electromagnetic spectrum. 
Bandgaps and cutoff wavelengths for some semiconductors and compounds are summarized 
in Table 10.1. 

An equivalent circuit of PD is shown in Fig. 10.6 [1], It consists of a current source, 
parallel junction resistance Rj, which is equivalent to the differential resistance of the diode. 
Rj is typically large, around 10 6 £2, and usually it is ignored in the analysis. C, is the junction 
capacitance; R s is the series resistance due to bulk and contact resistance and typically is a 
few ohms. Rl is the load resistance. In a typical analysis R, and R s are neglected. 

The spectral function V L (v) of the output voltage is related to the spectral function of the 
photocurrent i>(v) as 


V L (v ) = i P (v) 


1 + j2it vR/Cj 


(10.4) 
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Incident light 



Ti/Au metal contacts 
InAlAs 
Graded layer 

InGaAs 

(Photon absorption) 
InP buffer 
Si-InP substrate 

(a) (b) 



Schematic of a metal-semiconductor-metal (MSM) photodetector. Top view of the structure (a) and its cross-section 
view (b). 


The previous equation is obtained by solving KirchofF’s loop equation. Equivalent 
impedance is 

1 1.1 1 . . „ 1 + jcoCjR,. 

Using V[ (v) = ip(v) ■ Z e , Eq. (10.4) follows. Frequency bandwidth is therefore determined 
by the time constant CjRp. One wants to keep this time constant small in order to achieve 
a large bandwidth. 

We finish this section with a brief discussion of an integrated optical receiver using 
a metal-semiconductor-metal (MSM) photodetector, which is shown in Fig. 10.7. The 
MSM detector structure consists of an interdigital pattern of metal fingers deposited on a 
semiconductor substrate and a typical vertical PIN detector [6], Electric potential is applied 
between alternate fingers creating an electric field which sweeps photogenerated electrons 
and holes to the positive and negative electrodes, respectively. 

This structure shows several improvements compared to traditional designs, like a signif¬ 
icant improvement in sensitivity. Most of the improvements result from the lateral design. 

10.2.2 Performance parameters of photodetectors 


Main parameters which determine characteristics of PD are [7]: dark current, spectral 
response, quantum efficiency, noise, detectivity, linearity and dynamic range and speed and 
frequency response. We will now discuss them in more detail. 

Dark current 

Dark current is the current flowing in the PD without incoming light. Typical values for 
popular semiconductors are shown in Table 10.2. 
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Table 10.2 Dark currents for different semiconductors and their compounds. j 

Semiconductor 

Dark current (nA) 

Si 

0.1-1 

Ge 

100 

InGaAsP 

1-10 


Quantum efficiency 

Quantum efficiency is a measure of the efficiency of the generation of the electron-hole 
pairs (EHP). It is defined as [8] 

_ number of free EHP generated _ (10 5) 

number of incident photons Po/hv 

Here I p h is the photocurrent flowing in the external circuit, e is the electron’s charge, Pq is 
the incident optical power and hv is the energy of a single photon. 


Responsivity 

It is also known as spectral responsivity. It is defined as [8] 



( 10 . 6 ) 


Responsivity specifies the photocurrent generated per unit optical power. It can be expressed 
as 


(10.7) 


Ideal responsivity is therefore a linear function of wavelength X, see Fig. 10.8. 


Speed of response 

It determines how PD responds to an optical signal. A typical response to a pulse is shown 
in Fig. 10.9. 

Speed of response is determined by the RC time constant. In terms of parameters of 
previously introduced equivalent circuit, the rise time r,. is given by 

t r = 2A 9 -Rl-Cj 

The evaluation of x r is left as an exercise. Time response is directly related to the frequency 
response. The 3-dB bandwidth is [9] 

1 

2 txR l Cj 


fs-dB = 
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Wavelength (|tm) 


Typical current responsivity versus wavelength for Si. Also shown is the responsivity of an ideal photodiode with 
quantum efficiency rj = 0.9. 



Typical response of a photodetector to a square-pulse signal. 


10.2.3 Photodetector noise 


Typical optical signals arriving at the receiver front end are very weak. They are, therefore, 
significantly affected by various noise sources. 

There are several physical processes which contribute to noise. For example, the APD 
generates noise in the avalanche process and optical amplifiers produce noise due to am¬ 
plified spontaneous emission (ASE). In addition, there is always a quantum noise which 
exists in all devices and sets a fundamental lower limit on noise power. 

The working parameter characterizing the photodetector is the signal-to-noise ratio 
(SNR), which is defined as [10] 

S signal power from photocurrent 


N photodetector noise power + amplifier noise power 


( 10 . 8 ) 
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Noise appears as random fluctuations in a signal. A typical measure of noise is associated 
with the variance or the mean square deviation of the signal s, which is defined as [7] 

of = (s- s f — s 1 - s 2 
The mean value (average) s is defined as 

5 = s 

where p(s) is the probability of the measured signal having a value s and the sum is 
evaluated over all possible values obtained from measuring the signal. 

Noise in a signal s can be represented by random variable s„: 


If in the device (or system) there are two or more simultaneously present noise sources 
s n i,s„2, ■ ■ . which are independent, their combined effect is found by adding their mean 
square values (or their powers): 

s l = s l\ + s \i H - 

In terms of the above quantities, the SNR is expressed as 

SNR = == = = or SNR = 10 log = [ dB ] 

Main types of noise arising in a photodetector will now be summarized. 


Shot noise 

Shot noise is due to random distribution of electrons generated in the photodetector. It is 
associated with the quantum nature of photons arriving at the photodetector which gener¬ 
ates carriers. Photons arrive at the photodetector randomly in time due to their quantum- 
mechanical nature. Their randomness is described by Poisson statistics. The resulting 
expression for shot noise of current in the photodetector is [4], [7] 

I = 2eBT s (10.9) 

where e is the electron’s charge, B is the bandwidth and T s is the average value of signal 
current. 


Thermal noise 


Thermal noise or Johnson noise is due to the random motion of electrons in the resistor R. 
It is modelled as a Gaussian random process with zero mean and autocorrelation function 
given by [4] 


4 k B T 
R 


«(t) 
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Equivalent circuit model of a PIN photodetector with an amplifier. 


-O 


-O 


where 5(r) is the Dirac delta function. Thermal noise power in a bandwidth B is [7] 

p s ,th = 4k B TB (10.10) 

where k B = 1.38 x 10 -23 J/K is the Boltzmann’s constant and T is the absolute temperature. 
Thermal noise can be expressed as a current source [4], [7] 

Here I T is the parameter used to specify standard deviation in units of pA/VHz. Its typical 
value is IpA/VHz [2], 


10.2.4 Detector design 


In this section we perform a simple analysis of a PIN photodetector in the presence of noise 
[11]. Using results 10.6 and 10.7, the signal current of the detector is 


( 10 . 12 ) 


where P is the optical power coming to the detector at wavelength A.. Equivalent circuit of 
a PIN photodetector with an amplifier is shown in Fig. 10.10. It is a simple modification of 
the circuit shown in Fig. i0.6. 

Noise generated in the amplifier is essential for the operation. Amplifier noise is rep¬ 
resented [11] as Johnson noise originating in the load resistor R L . Using Eq. (10.1 1), the 
signal-to-noise ratio is 


S _ z 2 ■ Rl _ i 2 s ■ Rl 


(10.13) 


where B is the bandwidth and T is the resistor’s temperature. The £ ratio cannot increase 
indefinitely by increasing value of Rl because of the effect of junction capacitance C r In 
the typical situation, the bandwidth is determined by (Rl ■ Cj) . Using the expression for 
the bandwidth i? = (2jr • Rl ■ Cj ) 1 (see problem), Eq. (10.13) takes the form 


S i 2 

N ~ Sk B ■ T ■ B 2 ■ Cj 


(10.14) 
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Plot of Pmin versus bandwidth for PIN detector for three values of temperature (1500 K, 1000 K and 500 K). 

Minimum detectable signal 4, min can be determined from the above by setting ^ = 1. One 
finds 

4, min = 2 ■ B ■ (2n ■ k B ■ T ■ Cj) 1/2 (10.15) 

Minimum detectable optical power is determined from Eq. (10.12) as 

P mm = — -B-(2n-k B -T- C,) 1/2 (10.16) 

rjeX. K ’ 

The plot of Emin versus bandwidth B is shown in Fig. 10.11. The following parameters were 
assumed: C, = 1 pF, r) — 1, T — 1000 K, k = 0.85 |xm. MATLAB code is provided in an 
Appendix, Listing 10A.1. 


10.3 Receiver analysis 


Optical detection theory is extensively discussed, e.g. by Einarsson [12], The problem in¬ 
volves detection of signals with noise. In optical systems the information is transmitted 
using light which consists of photons. Due to their statistical nature, the transmitted in¬ 
formation will always show random fluctuations. Those fluctuations determine the lowest 
limit on transmitted power. In addition, there exist other noise contributions which originate 
from various processes. Some of them have already been discussed. 

Digital signal under consideration operates at a bit rate B. The time slot (or bit interval) 
T is 

1 


T = 
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and it is the inverse of the bit rate. The input data sequence in the communication system 
is denoted by {bk}. The optical power pit) falling on the photodetector is the sequence of 
pulses and it is written as 

pit) = J2 b k h p it-k-T) (10.17) 

where k is a parameter denoting the k-th time slot and h p (t) represents the pulse shape of 
an isolated optical pulse at the photodetector input. 

It is assumed that [5] 

^j°°h p (t)dt = 1 (10.18) 

so that b k represents the received optical power in the k-th time slot. 

Eq. (10.17) is based on the assumption that the system consisting of transmitter and 
optical fibre is a linear one and time-invariant, bk in Eq. (10.17) can take two values bo 
and b\, which correspond to logical values in the k-th time slot being ZERO or ONE. In 
an ideal case one would expect the bo to be zero so no optical power is transmitted for the 
logical ZERO. However, semiconductor lasers always operate at the nonzero bias current, 
so there is always some small optical power transmitted. 

10.3.1 BER of an ideal optical receiver 


There are several ways to measure bit error rate (BER), that is the rate of error occurrence 
in a digital data stream. It is equal to number of errors occurring over some time interval 
divided by total number of pulses (both ones and zeros). 

The simplest method to define BER is [13] 


BER = 


N e 

Np 


N e 


(10.19) 


where N e is the number of errors appearing over time interval t, N p is the number of pulses 
transmitted during that interval, B = l/T is the bit rate and T is the bit interval. 

The required BER for high-speed optical communication systems today is typically 
10 -12 , which means that on average one bit error is allowed for every terabit of data 
transmitted. BER depends on the various signal-to-noise ratio (SNR) of the fibre system, 
like the receiver noise level. 

Direct optical detection is a process of determining the presence or absence of light 
during a bit interval. No light is interpreted as logical ZERO; some of the light present 
signals is logical ONE. 

In a real life, the detection process is not so simple because of the random nature of 
photons arriving at the receiver. Their arrival is modelled as a Poisson random process. The 
random process, in time arrivals of photons at the photodetector, is shown in Fig. 10.12. 

For an ideal optical receiver we will assume that there are no noise sources in the system. 
The average number of photons arriving at the photodetector, with hv c being the energy of 
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T 

| Random arrivals of photons at photodetector are described by the Poisson process. Each photon is represented by a box 
and they all have the same amplitude. 


a single photon, is thus 

A t =^1 ( 10 . 20 ) 

hv c 

where pit) is power of light signal, h is Planck’s constant and v c is the carrier frequency. 
The optical power impinging on the photodetector is expressed by Eq. (10.17). 

A simple expression for BER for ideal receiver (no noise) can be obtained as follows 
(following Ramaswani and Sivarajan [2]). 

The probability that n photons are received during a bit interval T is 



where N is the average number of photons given by Eq. (10.20). Probability of not receiving 
any photons (n = 0) is exp (- N/B ). Assume equal probabilities of receiving ZERO and 
ONE. The BER of an ideal receiver is thus 

BER = ^ e~ N/B = X -e~ M (10.21) 

where M = ^ ^ represents the average number of photons received during one bit. 

Expression (10.21) represents BER for an ideal receiver and it is called the quantum 
limit. To get a typical bit rate of 10 -12 , the average number of photons is M = 27 per one 
bit. 


10.3.2 Error probability in the receiver 


Assume that a binary signal current at the photodetector is [4] 

itot(t ) = ■ h p (t — nTg) + Id + inoiseO) 

= iph(t) + Id + inoise (I ) 

where i p h(t) is due to the amplitude modulated pulse signal, Id is the dark current of the 
photodiode and ) is the noise current. B„ takes two values B H and B L corresponding 
to logical ONE and logical ZERO, respectively. 

Assume (after Liu [4]) that //(&>) is the combined transfer function of the front-end 
receiver and equalizer. The output at the equalizer is 

yau.it) = i tot (t ) ® h{t) 


( 10 . 22 ) 
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where <g> denotes convolution and hit) corresponds to the transfer function H{co). The 
signal component at the output of the equalizer is 


y,(f) = iphit) ® hit) = J2 B„ ■ h p (t - nT B ) ® h(t) 
= £ B n • h p {t - nT B ) 


The signal output of the equalizer is sampled at the bit rate and compared with a threshold. 
This process allows for detection of the amplitudes B„. The output signal after equalizer 
without (constant) dark current is 

y out it) = B n- Ppit - nTs) 


where p p (t) contains signal and noise. The above is sampled and the output at time 
nf B + r(0 < r < T B ) is 


youtAO = y<mtinT B + t) = J2 Bk ' Ppd n - k ^ Tri + r ) +y> 

m B n ■ Pp [0] + ISI„ + y no ise, n 


(10.23) 


with the following definitions: 


P P [n\ = PpinT B + t) 


isr n = J2 B k-P P ([n-k]) 

k±n 

and 

JW,» = y n oise,mt(nT B + r) 

is the sampled noise term. ISI„ refers to the intersymbol interference. 

In a practical receiver, the decision as which bit (ZERO or ONE) was transmitted in each 
bit interval is done by sampling current after equalizer. With the presence of noise, there is 
nonzero probability of an error. The errors are typically described by BER (bit error rate). 

Introduce y tB as a threshold used in error detection. From Eq. (10.23), one can observe 
that errors occur when 

B k = B h when y mt ^ n < y th 

or 

B k = B,_ when y rmt , n > y th 

Therefore, the probability of an error detection is [14] 

Perm- = P()P (yout.n > y t h\B k = B L ) + p\P (jW,„ < y th \B k = B H ) 

Here po and p\ are a priori probabilities for bits ZERO and ONE. 
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Determination of bit error rate for a Gaussian process. 


The decision taken involves comparison of the measured photocurrent I with a threshold 
I t h. If I > I,h, one decides that a ONE bit was transmitted; otherwise for / < I th a bit ZERO 
was transmitted. 


10.3.3 BER and Gaussian noise 


In optical digital communication the transmitted signal is never perfectly recovered. Noise, 
adjacent channel interference, amplified spontaneous emission, all result in creation of 
errors. Additionally, pulses are broadened due to fibre dispersion, device nonlinearity or 
slow photodiode response. Created signal distortion is known as an intersymbol interference 
(ISI) and it arises when adjacent bits are corrupted. It results in possible detection errors 
for the transmitted bits. ISI is random because the binary bits in a digital signal are random. 
To minimize ISI, an equalizer is generally used. 

We analyse the performance of real receiver with noise [15]. We want to establish 
connection between the signal-to-noise ratio (SNR) and BER. By referring to Fig. 10.13, 
we assume that V H (high voltage at the output of the receiver) and V L (low voltage at the 
output of the receiver) are random variables described by the Gaussian probability density 
function (pdf) as 

p(x) m (10.24) 

V2jr 

BER is defined in terms of two conditional probabilities P(0 | 1) and P( \ 10) as 

BER = p 0 P( 0| 1) + Pl P{\ |0) (10.25) 


where po and p\ are a priori probabilities for bits ZERO and ONE. Their values are po = 
Pi — \. Conditional probability P(A\B) for two events A and B is defined as 


P{A\B) = 


P(A • B) 

P(B) 
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Plot of bit error rate versus Q. 


The conditional probabilities P(0| 1) and P{\ 10) are shown in Fig. 10.13. Assuming Gaus¬ 
sian pdf, they are determined as 


P(0|1) = isi eAdx 

Qi 

P(1|0) - 

Qi 


where functions Q\ = v, *~ Vl and Q\ = VH a ~ v,h are integration limits. We assume that 
distributions of ZERO and ONE are identical and that noise variance a noise is the same for 
both. It follows that the optimum threshold is \ (V H + V L ). Thus 


Q = Qi 


= 0.2 = 


Vh-Vl 

2cr„oi se 


Using definition (10.25), the BER is evaluated as 


~le 

V2jt J 


The above function relates BER to the signal-to-noise ratio Q. It is plotted in Fig. 10.14. 
For a typical BER of 10 -6 , Q = 6. MATLAB code used to create this figure is shown in 
Appendix, Listing 10A.2. 
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The average incident input power P is defined as 

P=\<,Ph+Pl) (10.26) 

where Ph and Pl are optical powers corresponding to logical ONE and ZERO (high and 
low), respectively. Those powers are related to voltages V H and Vi introduced earlier as 


Vh = R-Ph • Ts,max(f) (10.27) 

Vl — R - P L - Ti,max(t) (10.28) 


where R is the responsivity of the detector andy, imax (t) is the maximum output voltage of 
the receiver during a given bit period. In terms of the above quantities, the average power 
P is evaluated as follows: 


p=\(p„+p,)='/^Pm-p, ) 
_ i i+ i i 1 


1 ijg 

2 ‘ l-r 


(10.29) 


The above expression gives the average incident input power level necessary to obtain a 
given BER. In the above derivation, we have used <y 2 oise = n 2 ollt , where n 2 out is the variance 
of the output noise. We also defined extinction ratio r as 


Pl_ 

Ph 


(10.30) 


For a PIN photodiode, the receiver sensitivity takes the form 


VP = 



Ideal sensitivity is obtained where no power is transmitted (r = 0) as 


(10.31) 


VPm 



(10.32) 


Using the above results, one can analyse receiver sensitivity for various types of noise 
filters. We refer to the literature for a more detailed discussion [15], [14], [4], 


10.4 Modelling of a photoelectric receiver 


A schematic of the receiver is shown in Fig. 10.15. It is modelled by a filter function H (v), 
see Geckeler [16]. As a front end it contains an avalanche photodiode, with M being the 
multiplication factor of the photocurrent and G the gain factor of an electronic amplifier. 
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Sft) 


H(y) 

T 

M 




G 



f Block scheme of the receiver which is described by a filter function H(v) with parameter r. 


Both M and G are assumed to be frequency-independent. Frequency dependence of the 
receiver is described by a low-pass filter function II (v), which we assume to be a real 
function (not complex). Possible phase distortions which will result in function H(y) being 
complex can be compensated by an equalizer. 

In Fig. 10.15 Si (v) is the frequency spectrum of the input, S 2 (v) is the output frequency 
spectrum, s(t) is the temporal output signal and Pi is the average power arriving at the 
photoelectric receiver. Parameter r characterizes filter function II(v). 


10.5 Problems 


1. Determine rise time z r for a simple RC circuit. Relate it to 3 dB bandwidth. 

2. Analyse noise in an avalanche photodiode (APD). Determine signal-to-noise ratio in 
terms of gain of APD. 

3. Analyse signal limitations for an analogue transmission. Determine signal power in 
terms of S/N ratio. 


10.6 Projects 


1. Conduct small-signal equivalent circuit analysis of a photodetector [1], Write a MAT- 
LAB program. Determine frequency dependence of the small signal impedance of a 
photodiode. 

2. Analyse frequency response of a photodiode [17]. Determine the upper limit of the 
frequency response of GaAs p-n junction photodiode assuming data provided in [17]. 


Appendix 10A: MATLAB listings 


In Table 10.3 we provide a list of MATLAB files created for Chapter 10 and a short 
description of each function. 
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Appendix 10A 


Table 10.3 List of MATLAB functions for Chapter . 

Listing 

Function name 

Description 

10A.1 

min_power.m 

Determines minimum signal power 

10A.2 

berQ.m 

Plots bit error rate versus Q 


Listing 10A.1 Function min power.m. MATLAB function which determines minimum 
signal power for a PIN photodetector for three values of temperature. 


°/ 0 File name: min_power.m 

”/, Determines minimum signal power required 
"/„ for a PIN detector 
clear all 

h_Planck = 6.6261d-34; 
c_light=299.80d6; 
k_B= 1.3807d-23; 
e=l.602d-19; 

y. 


) give S/N ratio of c 


1 Planck’s constauit (J s) 
l speed of light (m/s) 

% Boltzmann constant (J/K) 
°/ electron’s charge (C) 


C_j = ld-12; 
eta = 1; 

lambda = 0.85d-6; 
B = ld6:ld5:ldlO; 
1 


% junction capacitance (lpF) 

°/„ quantum efficiency 
7. wavelength (0.85 microns) 
l range of detector bandwidth 


for T_eff = [500 1000 1500] 

P_min= (2*h_Planck*c_light/ (eta*e*lambda)) *sqrt (2*pi*k_B*T_ef f *C_j ) *B; 
loglog(B,P_min,’LineWidth’,1.5) 
hold on 




xlabel(’Detector bandwidth (Hz)’ ,’ FontSize’,14); 
ylabel(’Minimum signal power (W)’FontSize14); 
text(ld9, 4d-7, ’500K’,’Fontsize’,14) 
text(5d7, 4d-7, ’1500K’,’Fontsize’,14) 

set(gca,’FontSize’,14) ; ”/, size of tick marks on both axes 

grid on 

close all 


Listing 10A.2 Program berQ.m. MATLAB program which plots bit error rate versus Q. 
Takes one minute to execute. 


•/. File name: berQ.m 

"/. Plots bit error rate versus Q 

syms x 


°/, defines symbolic variable 
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f=(l/sqrt(2*pi))*exp(-x~2/2) ; °/ 0 defines function for plot 

Q_i = 3.5; Q_f = 8; Q_step = 0.5; °/ 0 defines range and step 

yy = double(int(f,x,Q_i,inf)); 
for Q = Q_i+Q_step:Q_step:Q_f 
y = double(int(f,x,Q,inf)); 

yy = [yy.y]; 

end 

Qplot = Q_i:Q_step:Q_f; °/ 0 defines Q for plot 

semilogy(Qplot,yy,'LineWidth',1.5) 

xlabel( ’ Q’,’FontSize 1 ,14); 

ylabel(’Bit Error Rate’,'FontSize 1 ,14); 

set(gca,'FontSize',14) ; °/ 0 size of tick marks on both axes 

grid on 
pause 
close all 
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Finite difference time domain (FDTD) formulation 




The finite-difference time-domain (FDTD) method is a widely used numerical scheme for 
approximate description of propagation of electromagnetic waves. It can be used to study 
such phenomena as pulse propagation in various media. It is a simple scheme and can be 
mastered relatively fast. 

Definite reference on the subject is a book by Taflove and Hagness [1], Several other 
books can be cited: Kunz and Luebbers [2], Sullivan [3], Yu et al. [4], Elsherbeni and Demir 
[5], Also, recently published books by Bondeson et al. [6] and by Garg [7] have instructive 
chapters on the FDTD method. 


11.1 General formulation 


To develop a general formulation, one starts from Maxwell’s equations which we cast in the 
form suitable for ID, 2D and 3D discretizations. In the following, we assume that J = 0 
and p = 0; i.e there is no flow of current and no free charges. Assume also nonmagnetic 
material and therefore set p = p$. General Maxwell’s equations are 


V 


V 


( 11 . 1 ) 

( 11 . 2 ) 


Here E and H are, respectively, electric and magnetic fields and D is the electric displacement 
flux density which is related to electric field E. The relation between E and D will be 
summarized in later sections for more detailed models. 

First we write the above equations in terms of components. The following cases are 
typically considered: 


1. ID (one-dimensional) case: infinite medium in y-direction and z-direction and no change 

in those directions and therefore ^ ^ = 0. 

2. 2D (two-dimensional) case: infinite medium in z-direction and no change in z-direction 
and therefore fz = 0. 

3. 3D (three-dimensional) case: no restriction on variations in all three directions. 


In this chapter we will not discuss the 3D case in detail since it requires significant 
hardware resources for typical calculations. 
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General formulation 


11.1.1 Three-dimensional formulation 


\ / 3H- a 

) a - v (nr 


Use general mathematical formula 

< H = 

H x H y H z 
(d H z 

~ a *U 

By applying the above expression, Maxwell’s equations can be written as 
dD x _ dH z _ 3H y 
3 1 3 y dz 

3 Dy _ 3 H z dH x 

3 1 dx 3 z 

3 D z _ dH y dH x 
3 1 dx 3 y 

m x _ 3 E z dEy 

^ ° 3 1 dy dz 


/ dH y _ 3 H x \ 
V”37 ~~dy) 


Mo 


dHy 

~di : 

3 H z 


dE z _ dE x 
dx dz 


™ dt dx dy 

These are general equations used as a starting point for 3D implementation, see e.g. [1], 


11.1.2 Two-dimensional formulation 


In the 2D model we assume that = 0 and evaluate 


<H = l Ty 0 

H x H v H z 


3/4 
lx 3 y 


dH z 


/ dHy 


dHA 

dy J 


where n x ,a y , a z are (as before) unit vectors defining coordinate system. Using the above 
expression, Maxwell’s equations in terms of components become 


3/4 
lx 3 y 


dH. 


(d Hy 3/4 \ 

a V dx dy) 


3 D x 

lx ~dT 


3 Dy 


3 D z 

iz ~dT 


d E z d E z (dEy dE x \ ( 3/4 dHy d H-\ 

^ + * z (ar - vj= “ Mo ( a ^ +a ^ir + a -irj 


The above equations are then usually split into two groups, see [1] and [8]. 
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Group 1: 

3D X _ dH z 

3 1 3 y 

3 D y _ 3 E z 

3 1 dx 

3 H z (dE y 3 E x \ 

3 1 ^ 0 \ dx 3 y) 

Group 2: 

3/4 _ 1 3 H z 

3 1 ix o 3y 
3// v _ 1 dE z 
dt (X o dx 
3D- _ 3/^ _ 3/-4 
3f 3x 3y 


(11.4) 

(11.5) 

( 11 . 6 ) 

(11.7) 

( 11 . 8 ) 
(11.9) 


The first group is called TE X mode by Taflove and TE to z polarization by Umashankar, 
and the group 2 is called TM Z mode by Taflove and TM to z polarization by Umashankar. 


11.1.3 One-dimensional model 


In one dimension, the medium extends to infinity in the y-direction and z-direction. There¬ 
fore « | = 0. Using the following result, 


V 



a r 


0 

H z 


3 H z dHy 

" a,: ihr + 


one can write Maxwell’s equations in two groups according to field vector components. 

First group is 


dHy _ 1 3 E z 
dt /x 0 dx 

3 D z _ dHy 

dt dx 


( 11 . 10 ) 

( 11 . 11 ) 


Taflove [1] classifies this group as the x-directed, z-polarized TEM mode. It contains D y 
and H y . 

The second group is 


3 H z _ 1 dE y 

dt IX o dx 


( 11 . 12 ) 


3 Dy _ 3 H z 
dt dx 


(11.13) 


This group is referred to as the x-directed, y-polarized TEM mode. It contains D y and H z . 
In the following, we will concentrate on group 2. The equations describe wave propagating 






amplitude 
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General formulation 



Gaussian pulse (left), and modulated Gaussian pulse (right). 


in the x-direction with electric field oriented in the y-direction and magnetic field oriented 
in the z-direction. 

We finish this section with a discussion of the Gaussian pulse and modulated Gaussian 
pulse. 


11.1.4 Gaussian pulse and modulated Gaussian pulse 


In practice one often analyses propagation of the Gaussian pulse or modulated Gaussian 
pulse. A Gaussian pulse centred at to is defined as 

g(0 = exp J^- ^ j (11.14) 

where w is the width of the pulse in space. Define also the modulation signal as 

m(t) = sin (2 tt/ 0 • t) (11.15) 

where fo = c/A, 0 = l/T 0 . Here T 0 is the period of the modulation signal m(t). A continuous 
Gaussian pulse is created by multiplying Eqs. (1 1.14) and (11.15): 

g(t) = sin (27 t/ 0 • t) • exp (t ~* o) j (11.16) 

Its discretized version is 

g(n) = sin {Info ■ « • At) • exp ^ ^2 ^ j (H-17) 

In Fig. 11.1 we plot Gaussian and modulated Gaussian pulses. The results were obtained 
with MATLAB code presented in Appendix, Listings 1 1A.1 and 1 1A.2. 
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Finite difference time domain (FDTD) formulation 


11.2 One-dimensional Yee implementation without dispersion 


In this section, we discuss the basic implementation of FDTD method in ID. It is known 
as the lD-Yee algorithm since it was first proposed by Yee [9] for the 3D case. We also 
introduce the concept of a staggered grid [3], [7]. To illustrate basic principles of FDTD 
method, let us develop first the ID lossless case. 

11.2.1 Lossless case 


Our formulation is based on the second group of Eqs. (11.12) and (11.13). We supplement 
those equations with the relation between D y and E y , which for the free space is 


The equations take the form 


In order to introduce ID Yee discretization, let us introduce the following notation: 

E y (i ■ Ax, n ■ At) = (Ey)" 

H z (i- Ax,n- At) = (Hy)" 

To discretize Eqs. (1 1.19) and (1 1.20), we use central-difference approximation for space 
and time derivatives. They are second-order accurate in the space and time increments: 

(Ey)" + 1/2 ~ ( Ey)U 


Dy = 

= £0 Ey 

(11.18) 

dH z _ 

1 dEy 

(11.19) 

~di~ ~ 

Ho 9x 

dE y 

1 9H Z 


”aT “ 

£0 dx 

(11.20) 


- (i ■ Ax, n ■ At) = - 


F O(Ax) 2 


—2- (/ • Ax, n ■ At) = - 


In the Yee algorithm [9] E and H components are interleaved in the space lattice at intervals 
of Ax/2. Such an approximation is called the staggered grid approximation. They are also 
interleaved in time at intervals At/2, see Fig. 11.2. 

At time t = 0, values of E y are placed at x = i • Ax, with i = 0,1, 2,... N. (Total N + 1 
components). 

At time t = \ At, values of H z are placed at x = (i — \) • Ax, with i = 1, 2,... N. (Total 
N components). 

In general 


( E y )1 = Ey((i- \) ■ Ax,n- At), i = 1 , 2 ,... N+ 1 ; n = 0 , 1,2 ... 

( H z )1 = H z • Ax, (n - ^ • Af^ , i = 1,2,.. ,N\ n = 0, 1,2... 
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H z h_ H z H z 

— e- -e-- &■ -- 

^ E y /j & | E y ^y jfo 


H. H r H r K 


Ey Ey Ey 

A _I_ L^ x 


= 0 Ax 2 Ax 3 Ax 4Ax 

One-dimensional formulation of the FDTD method. Orientations of H z and E v fields are shown. 



0 ° 

) □ o 

. yt\ 

0 Ey 

> o/4 


1 


r-1 r-j i i~\ i +1 


Visual illustration of numerical dependencies in the ID FDTD method. 


Initial values (corresponding to n = 0), i.e. E(...,t = 0) and H(... ,t = 0), in most cases 
are set to zero. 

The discretized equations are 

(£ y )” +1/2 - (Ey)r l/2 | 1 (^)r+l/2 - ( g *)”-l/2 

A t sq Ax 

(^)r+i/2 ~ (^)”+i/2 = 1 (Ey)^ /2 - {E y )”~l 12 

At (j, o Ax 

The space-time interdependencies of E y and H z fields at different grid points are illustrated 
in Fig. 11.3. Here, squares represent E y field and circles represent H z field. One can observe 
that the value of a field at any point is determined by three previous values: two from two 
neighbours of opposite field from the previous half time step and one from the same field 
at a single previous time step. 

















Finite difference time domain (FDTD) formulation 




Constants £o and /xo appearing in these equations differ by several orders of magnitude. 
Therefore, one usually introduces a new scaled variable defined as [3] 



Scaled equations are 

K +X/2 (i) = E;r Xj 2 (i) - [H"(i + 1/2) — H"(i — 1/2)] 

V®oMo Ax 

//; +I (r+ 1/2) = 7/;(r+ 1/2) - —= ^ \E n + xl2 (i+\) - E n +' /2 (i)] 
*jeoiaq Ax l ^ ' J 

In our simulations, due to stability reasons we set 


1 Ax _ 1 
teoMo At 2 


( 11 . 21 ) 


The value of v = Ax/ At is chosen to be the maximum phase velocity of the wave expected 
in the medium. The above system has been implemented. The MATLAB program is shown 
in Appendix, Listing 1 1A.3. Two versions of MATLAB code have been produced: one of 
which uses regular loop and one that takes advantage of parallel capabilities of MATLAB. 
The reader should analyse speed of operation for both versions. 

The results of propagation of Gaussian pulse using that code are shown in Fig. 11.4. 










One-dimensional Yee implementation without dispersion 


11.2.2 Determination of cell size 


In one dimension the cell size is just Ax. In 2D it is a square and a cube in 3D. The 
fundamental question is how to determine size of basic cell. 

For a detailed discussion we refer to the book by Kunz and Luebbers [2], For us, it is 
enough to notice that cell size should be much less than the smallest wavelength and typical 
rule is to take it as A./10. Such choice should provide accurate results. 

11.2.3 Dispersion and stability 


Here, we discuss some general properties of the wave equation. It was formulated before. 
Its discretized version is (here we use index r instead of i to avoid conflict with a symbol 

i = V=T) 


E’;+ l - 2E” + E"- 1 2 E n r+l - 2E n r + E”_ l 

At 2 ~ C Ax 2 


( 11 . 22 ) 


As usual, n refers to time and r is the space index. Explicitly £j" = E(r ■ Az, n ■ At). The 
dispersion relation for Eq. (11.22) is found by assuming 


Substituting these assumptions into Eq. (11.22) gives 


r -ibte-(r+l) _ 


_ <r At 2 
~ Ax 2 

Dividing the above by exp[z (m ■ n ■ At — k ■ r ■ Ax)]: 

c 2 A t 2 , _ ik 


-2 + e 


A z 2 


The above can be written as a square: 

^ico-M/2 _ e -iw At/2\ 2 


c 2 At 2 
Ax 2 


(e ikA */i 




Finally, using trigonometric relation e la —e = 2i sin a, one obtains formula for numerical 

dispersion: 

1 cAt 1 

sin-o;-Ar = ±-sin -k ■ Ax (11.23) 

2 Ax 2 y J 


The important case is that of linear dispersion, i.e. when co = c - k. Eq. (11.23) then holds 
when Ax = cAt. For such a linear case different frequency components propagate with 
the same speed, just c. For nonlinear dispersion relations, different frequencies of a pulse 
numerically propagate with different velocities and therefore pulse will change shape. 

Introduce parameter 


At 
C Az 


(11.24) 
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In Fig. 11.5 we showed numerical dispersions as described by Eq. (11.23) for several values 
of parameter a. MATLAB code is shown in Appendix, Listing 11A.4. Depending on the 
value of a, one can distinguish the following cases: 

1. a = 1. In this case At = ^.Numerical dispersion is linear and given by co = ±c-k. This 
choice of time step At is called the magic time step [10]. In this case wave propagates 
exactly one cell (or Ax) per time step in both x-directions. 

2. a < 1. In this case A t < One observes numerical dispersion which increases when 
a decreases. 

3. a > 1. In this case At > ^. This inequality implies that numerical front propagates 
faster than physical speed (given by c). In this case, the scheme becomes unstable. 

Because of its importance, we provide next a separate discussion of the stability criterion. 


11.2.4 Stability criterion 


Stability criterion imposes condition on ^ ratio. It is known as Courant-Friedrichs-Levy 
(CFL) condition [11 ]. It explains convergence of various schemes. 

For the ID model, the CFL condition is illustrated in Fig. 11.6. Each row of points 
represents the sampling points in space at a given instant of time (say n). The solution at 
point A is determined using previous values at the indicated points. In the figure we show 
characteristics which correspond to stable (left) and unstable (right) situations. 

The stability criterion summarizes the physical fact that speed of numerical propagation 
should not exceed the physical speed. A summary of stability criteria for various dimensions 
is presented in Table 11.1 [10]. 
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| Table 11.1 Summary ofstability criteria for various dimensions. \ 

Dimensionality 

Criterion 

ID medium 

v At < Ax 

/ , . \-i /2 

2D medium 

/ - . , \-l/2 

3D medium 

V At ^ ( A? + A? + A?) 



Illustration of the CFL condition. For the parameters chosen within the dashed lines, the propagation is stable (left 
figure) and becomes unstable (right figure.) 


11.2.5 One-dimensional model with losses 


When we consider ohmic losses, Maxwell’s Eq. (1 1. 1) is modified as 


V 


(11.25) 


where a is the conductivity. 

The discretization of Eqs. (11.25) and (11.2) with losses is done in a similar way as for 
the lossless case. The resulting equations after scaling are 

c E y yi +l12 - (E y )r l/2 = i m? +1/2 - mu ,2 

At Sr^/so/J-o Ax 

((Ey)" +l/2 + (E y )r l/2 ) (11.26) 

2e r e 0 

and 

1,2 - (^)”+l/2 1 (^)"+! 1/2 - «~ 1/2 

Mo Ax 


At 


(11.27) 

































Finite difference time domain (FDTD) formulation 


boundary 


direction of 
travelling wave 


i = 1/2 i=l 


' boundary point 

Illustration of propagating wave close to the left boundary. 

When selecting the time step given by Eq. (1 1.21), one arrives at the following equations: 



Solving the first equation for£’" +1/2 (r) gives 



Implementation and analysis of the above equations is left as a project. 


11.3 Boundary conditions in ID 


11.3.1 Mur's first-order absorbing boundary conditions (ABC) 


Here, based on intuitive arguments we derive the so-called Mur’s first-order absorbing 
boundary conditions [12], Consider plane wave (j){x,t) travelling left at the left boundary, 
see Fig. 11.7. 

Reflections of this wave will be eliminated if [7] 


<p(x, t) = cf)(x — Ax, t + At) 


(11.29) 


Perform Taylor expansion 
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Velocity of the propagating wave is 

Ax 
° ~ At 

Substitute into Eq. (11.29) and have 

3 (h 3 <b 

<p(x, t ) = 0(x, t) - cAt + A t 

3x 3 1 

or 

3 <f> 1 30 

3x c dt 

It describes ABC for a plane wave incident from the right. 

Finite difference of Eq. (11.30) close to the left boundary is 

01 +1/2 ~ 0g +1/2 _ 1 ~ * 1,2 

Ax c At 

The values at half grid points and half time steps are determined as 


<r ,/2 = \ (4>r'+4>D 

Cl/2 = 2 (^H-l $) 


(11.30) 


(11.31) 


Substitute the above into Eq. (1 1.3 1) and obtain an expression for 0p +l : 

r ° =<p " + ^ + (1L32) 

The formula (11.32) determines the new (updated) field value at the boundary node point 
at x = 0. 

The above expression is known as the first-order Mur’s absorbing boundary conditions 
(ABC). A similar derivation can be performed for right boundary. It is left as an exercise. 


11.3.2 Second-order boundary conditions in 1D 


We will provide a more rigorous discussion of the boundary conditions based on the work 
by Umashankar [8] and Taflove [10]. It allows for the introduction of the second-order 
Mur’s radiation boundary conditions. 

We start with a ID wave equation for any scalar component of the field 


3 2 U _ J_3 2 U_ 

dx 2 C 2 dt 2 


(11.33) 


Introduce operator L defined 


The wave equation can therefore be written as 


LU = 0 
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U 0 U 1/2 A 


U N .i U n . 1/2 U n 


| Illustration of derivation of ABC in one dimension. 


The above operator can be factored as 

LU = L + L~U = 0 

where L + = D x + ] -D, and L~ = D x — ] -D t . 

Suppose that ID computational region extends from 0 to h\ i.e. one has 0 < x < h. 
Engquist and Majda [13] showed that at x — 0 the application of an operator L~ to U 
exactly absorbs a plane wave propagating towards the boundary. A similar relation holds 
for an operator Z+ operating at x = h. 

In light of the above, the PDE that can be numerically implemented as a first-order 
accurate ABC at x = 0 grid boundary is 


L~U = (d x - l -D^ u = 0 


Multiplying the above by an operator A, one finds 

A (A - -A ) U = 0 


- = 0 


(11.34) 


In the full form 

d 2 U _ 1 d 2 U 
dtdx C 3 1 2 

which is a second-order ABC at x = 0. For the above equation original numerical schemes 
have been proposed by Mur [12], Here, we followed an improved scheme as described by 
Taflove [10]. 

Let U represent a Cartesian component of E or H located on the Yee grid, see Fig. 11.8. 
We use central differences expansion around an auxiliary grid point x = 1/2 and time 
t„ = n- At. The derivatives are approximated as follows: 

3 2 C71" _ 1 I dU |" +1 dU |" _1 

dtdx L» 2 At 


i I" du r +1 _ bu r _1 "| 

2At dx |,/ 2 dx | 1/2 J 

= i wr-vr 

2At [ Ax Ax | 


3 2 U I 

3 1 2 I, 


-Ip 


3 2 U I 

L 


3 2 C7 I 

' ~w\ 


,+i - 2 +c/” -1 A w+l - 2u "+ur 


At 2 


At 2 
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Substitution of the above results into Eq. (1 1.34) and solving for Uq gives 

ut l = K 


cA ' - (CT + UT') + {U? + US) (11.35) 


cAt + Ax v 1 0 ' cAt + Ax K 

That expression determines the value of the field U {x, t ) at the next time t n+ \ = (n + 1) • At 
forx = 0. 

In exactly the same way one obtains the updated equation at the right boundary. The 
ABC wave equation is 

d 2 U 1 d 2 U 
dtd^c + c~dt 2 

The derivatives are approximated as follows: 


- = 0 


a 2 c/1" _ 1 I" dU |" +1 _ ac/r -1 "I 

dtdx \ N _x/2 2A t dx dx |j V _ 1/2 J 

= 1 f u n N +x - u; + _\ u n N - x - u n N- 1 

2A t I Ax Ax 


(11.36) 


r = i| 

r d 2 u\ 

n d 2 U \" 

1 

IV—1/2 2 


v + ^L 

J 

_ 1 

\ur 

- 2U$ + U^- ] 

1 u**\ - 2f/^_! + u": 

2 | 

{ 

At 2 

At 2 


Substituting the last result into Eq. (11.36) and solving for f/ 0 " +1 gives 


m +l = u"~_ 




(t# + E#_,) (11.37) 


11.4 Two-dimensional Yee implementation without dispersion 


We assume medium with the constitutive relation of the form 
D y = soe r E v 

Using the above constitutive relation, from the group 2 (Eqs. 1 1.8) (also called TM Z mode) 
of Maxwell’s equations one has (e = sos r ) 

9 E z _ 1 /dH y 3 H x \ 

dt S \ dx dy J 
a h x _ i m z 

dt fJL 0 dy 

a H y _ i a e z 

dt fXo dx 
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j+i 

j+l/2 


Ay 


j-1/2 ; 



i-1 i-1/2 i i+1/2 


Yee mesh in 2D for the TM : mode. 


For 2D discretization we introduce the Yee mesh, shown in Fig. 11.9. A Yee cell is formed 
around point p, j). The electric field distribution within each cell is assumed to be constant 
and the z-component E z of electric field is defined at the midpoint of each cell. We also 
introduce p, j ) = (7 Ax, j Ay) and F n (i, j ) = F(iAx, jAy, nAt). Maxwell’s equations are 
approximated as 

Trn+l/lf. , , n _ Tjn+ 1/2 ,, 


E z (i, j) — E" p, j) 
At 


i /C c i +1/2, j) - H*r ' a - 1/2, j) 

s 0 e r (i,j) Ax 

1 f/; +1/2 (i, j + 1/2) — H? +l/2 O', j - 1 /2) 

so£r(i, j) Ay 

j + 1/2) - ffT l/2 (z, 7 + 1/2) _ 1 £/(//+ 1)-£”Q,./) 


(11.38) 


(11.39) 


H y n+i/z Q + 1/2, /) - 0 + 1/2, j) _ 1 gQ + 1 , j) - E” Q, j) 

At (Xo Ax 

From the above equations one determines the time-stepping numerical algorithm for the 
interior region. For the same reason as in the ID case, we need to perform scaling. As 
before we introduce new scaled variable E z as follows: 


An algorithm for scaled variables is 


£ z " +l 0, j) = E"(i,j) + 


1 f Hy +l/2 (i + 1/2, j) - H" +1/1 (i - 1/2, j) 


soErdJ) V £ oMo I 


H" +1,2 (i, j + 1/2) - Hr ' (i, j — 1/2) I 
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Absorbing boundary conditions (ABC) in 2D 



; | Propagation of a Gaussian pulse initiated in the middle and travelling outwards. Four values of time are shown. 


H” + y\i, j + 1/2) = j + 1/2) - ^ —= [E”(i, 7+1)- E”(i, j)] 

(11.43) 

//; +,/2 (f+1/2,7) = /T- l/2 (r + 1/2,7) - T ' [^’(f+ 1,7)-^(U)] 

' A_y +e 0 +o 

(11.44) 

The above equations were implemented in MATLAB for the propagation of Gaussian 
pulse. A MATLAB program implementing 2D propagation of the Gaussian pulse is shown 
in Appendix, Listing 11 A.6. 

Propagation of the Gaussian pulse initiated in the middle and travelling outwards is 
shown in Fig. 11 . 10. We illustrate pulse shape at four values of time. One can then observe 
its spread in time. 

11.5 Absorbing boundary conditions (ABC) in 2D 


Absorbing boundary conditions (ABC) in 2D are derived here. We follow the methodology 
based on the wave equation used in the ID case [7] and [10]. The two-dimensional wave 
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Rectangular region used to illustrate derivations of ABC. 


equation for scalar component U is 

1 d 2 U d 2 U d 2 U 
c 2 dt 2 dx 2 3 y 2 

In a rectangular coordinate system, the plane-wave time-harmonic solution to wave equation 
is 

U(x,y, t ) = 


Substituting into a wave equation gives the condition 


]£ + ]? = %-=]?■ 
c 2 

Solution for k x is therefore 


k x = 



The last result can be written 


kk x = i* 2 


(11.45) 


From the above relation, we can reconstruct partial differential equations that can be nu¬ 
merically implemented as a second-order accurate ABC. First observe that for a rectangular 
region shown in Fig. 11.11 near x = 0 boundary, the left travelling wave is described by the 
following equation: 


1 3 2 17 _ 1 3 2 U 1 3 2 U 
c 3x3 1 c 2 dt 2 2 3 y 2 


(11.46) 


Analogously, at the x = h boundary 


1 3 2 C7 _ _ 1 3 2 U 1 3 2 U 
c 3x3 1 c 2 dt 2 2 dy 2 
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Absorbing boundary conditions (ABC) in 2D 


y = Q+l)Ay 

't/ojH 

• C/ij+i 




y=]Ay 

■t/ojX 

• u u 

v = (j-l)Av 

t/oj-i 

U xi „ x 


Numerical boundary conditions. 

To obtain numerical boundary conditions, we approximate the above derivatives around the 
central point j), see Fig. 11.12. 

One obtains 

= i 9L T a ',1 

3x3 1 2A t \ dx 3x [ 






2AtAx 2 At Ax 

The second-order derivative with respect to time t is implemented as an average of time 
derivatives at the adjacement points (0, j) and (1, j) as 
3 2 t/" 2J 1 J d 2 U£j 3 2 U"j 

3 1 2 = 2 | 3 1 2 + 3 1 2 

C/n"! 1 - 2£/»+ C/"; 1 U?+' - 2U "■ 


2 At 2 

Also, as an average of the y derivatives we implement 
PUfaj _ if d 2 U?j | 

dy 2 2 f dy 2 3y 2 | 

USj + 1-2US. 


2A t 2 




U"j+i ~ + U"j- 

2 Ay 2 


2 Ay 2 

Substitute the above results into Eq. (11.46) and have 

Ki ' - u of u "f - u lt l 

2 At Ax 2 At Ax 

= (u o n ? - 2 Kj + - 2U"j + C/fJ 1 ) (11.47) 

- 2C^y + C/ 0 ”y-i + 2C/,T + C^_,) 

From the above we can find the time-stepping algorithm for the component of U along 
x = 0 boundary, i.e. £/"+'. 
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11.6 Dispersion 


Dispersion exists when medium properties are frequency-dependent. Examples include 
dielectric constant and/or conductivity which can vary with frequency. Generally, the fol¬ 
lowing situations exist [1]: 

• linear dispersion 

• nonlinearity 

• nonlinear dispersion 

• gain. 

Here, we will only summarize the most important models of material dispersion and 
briefly outline one of the possible numerical approaches to dispersion. 

11.6.1 Material dispersion 


Usually, the dependencies are established in frequency domain. The defining equation is 
D(oj) = eoE r (oj)E(a>) 

The physics is contained in e r (co). We assume non-conducting media. Main cases are: 

• Debye medium (first order) 


S(a>) =Eoo + 


e s ~ Soo 
1 + jojto 


• Lorentz medium 


e r (w) = s r + 


•+^(sHs) 1 2 


(11.48) 


Specific analytical and numerical work with the inclusion of dispersion is left as a project. 


11.7 Problems 


1. Derive an expression for the first-order Mur’s boundary conditions at the right 
interface. 

2. Determine the residual reflection produced by Mur’s first-order ABC. Assume the 
existence of the following wave at the boundary: 

0(x, t) = exp ( icot ) • [exp (ikx) + R exp (—ikx)} (11.49) 


Analyse reflection coefficient R. 







Appendix 11A 


j Table 11.2 List of MATLAB functions for Chapter . j 

Listing 

Function name 

Description 

11A.1 

gauss.m 

Plots Gaussian pulse 

11A.2 

gauss jnodul.m 

Plots modulated Gaussian pulse 

11A.3 

fdtd free.m 

Propagation of Gaussian pulse in a free space 

11A.4 

num disp.m 

Illustrates numerical dispersion 

11A.5 

gmf.m 

Animates Gaussian pulse in a free space (not used in text) 

11A.6 

fdtdlD.m 

Propagation of Gaussian pulse in 2D 


11.8 Projects 


1. Implement a ID FDTD approach with losses. Analyse propagation of Gaussian and 
super-Gaussian pulses. 

2. Implement boundary conditions for ID and 2D problems. 

3. Use a convolution integral approach which relates D and E: 

D(t) = eooSoEit) + e 0 J E(t - t')x(t')dt' (11.50) 

to develop discretization scheme for the first-order Debye medium as described by 
Eq. (11.48). Implement in MATLAB the resulting numerical scheme. For more details, 
consult the book by Kunz and Lubbers [2], 

4. Conduct similar analysis for the first-order Drude dispersion; i.e. 

e(co) = l + — ^-- (11.51) 

CO (jv c - co) 

where co p is the plasma frequency and v c is the collision frequency. 


Appendix 11 A: MATLAB listings 


In Table 11.2 we provide a list of MATLAB files created for Chapter 11 and a short 
description of each function. 

Listing 11A.1 Program gauss.m. MATLAB program which creates Gaussian pulse. 

•/. File name: gauss.m 
l Plots Gaussian pulse 
clear all; 

7. 

t_zero = 20.0; 


7. center of incident pulse 









282 


Finite difference time domain (FDTD) formulation 


width = 6.0; X width of the incident pulse 

X 

N = 300.0; 

time = linspace(0,50,N); 

pulse = exp(-0.5*((t_zero - time)/width) . ~2) ; X Gaussian pulse 

plot(time,pulse,’LineWidth’,1.5) 
xlabel(’time’,’FontSize’,14); 
ylabel(’amplitude’,’FontSize’,14); 

set(gca, ’FontSize’ ,14) ; X size of tick marks on both axes 

pause 

close all 


Listing 11A.2 Program gauss_modul.m. MATLAB program which creates a modulated 
Gaussian pulse. 


°/ 0 File name: gauss_modul.m 
X Plots modulated Gaussian pulse 
alpha = 0.6; 

mu_0=4*pi*le-7; e_0=8.854e-12 
c=l/sqrt(mu_0*e_0); 
dx = 4.2e-2; 
dt=alpha*dx/c; 
f_mod = 2e9; 
t_0 = 3e-09; 

T = t_0/3; 

N_x = 400; 
spec = [0:N_x]; 
for n = 1:100 

y(n) = exp(-(((n-l)*dt-t_0)/T). ~2) 


”/, fundamental constants 
•/. velocity of light 
X step in space 
”/, time step 

l modulating frequency (2 GHz) 

X peak position of Gaussian pulse : 
X pulse width 


is (2*pi*f _mod* ( (n-1) *dt-t_0) ) ; 


plot(y,’LineWidth’,1.5) 

xlabel( ’ time’,’FontSize’,14); ylabel(’amplitude’,’FontSize ’ ,14); 
set(gca,’FontSize’, 14) ; °/ 0 size of tick marks on both axes 

pause 
close all 


Listing 11A.3 Function fdtd_free.m MATLAB program which describes propagation of 
a Gaussian pulse in a free space using the FDTD method. 

°/« File name: fdtd_free.m 

% Plots E and H components of Gaussian pulse in a free space 
% after some number of time steps using FDTD method 
X No animation 
clear all; 

N_x = 200; X number of x-cells 



283 


Appendix 11A 


% number of time steps 

% Initialize fields to zero at all points 


% center of incident pulse 
% width of the incident pulse 


N_time = 100; 
time = 0.0; 

E_y = zeros(N_x,1); 

H_z = zeros(N_x,1); 
t_zero = 40.0; 
width = 12; 
tic; % start timer 
for istep = l:N_time 
time = time + 1; 

pulse = exp(-0.5*((t_zero - time)/width)~2); °/„ Gaussian pulse 
for i = 2:(N_x) 

E_y(120) = pulse; °/ 0 location of initial pulse 
E_y(i) = E_y(i) - 0.5*(H_z(i) - H_z(i-1)); 

end 


% time loop 


l 

for i = 1:(N_x -1) 

H_z(i) = H_z(i) - 0.5*(E_y(i+l) - E_y(i)); 


toe ’/ measure elapsed time since last call to tic 

subplot(2,1,1); plot(E_y) 

ylabel(’E_y’); xlabel(’FDTD cells’) 

axis([0 200 -1 1]) 

subplot(2,1,2); plot(H_z) 

ylabel(’H_z’); xlabel(’FDTD cells’) 

axis([0 200 -1 1]) 


close all 


Listing 11A.4 Function num_disp.m. MATLAB program used to produce Fig. 11.5 to 
illustrate numerical dispersion. 

% File name: num_disp.m 
% Illustrates numerical dispersion 

N_max = 10; % number of points for plot 

x = linspace(0,pi,N_max); 
hold on 

for alpha = [1.0 0.8 0.2] 

y = 2*asin(alpha*sin(x/2)); 
plot(x,y,’LineWidth’,1.5) 

end 

xlabel(’k \Delta z’,’FontSize’,14) 
ylabel(’\omega \Delta x /c’,’FontSize’,14) 

set(gca,’FontSize’,14); % size of tick marks on both axes 
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text(2.5, 3, ’\alpha = 1’,’Fontsize’,12) 
text(2.5, 2, ’\alpha = 0.8’,’Fontsize’,12) 
text(2.5, 0.5, ’\alpha = 0.2’,’Fontsize’,12) 
pause 
close all 


Listing 11A.5 Function gmf.m. MATLAB program which generates and animates prop¬ 
agation of a Gaussian pulse in a free space. 


•/. File name: gmf.m 

*/, gauss, modulated, free space 

7, Animation of propagation of Gaussir 

7, without boundary conditions 

7, Program not used in the main text 

clear all 


alpha = 0.5; 
mu0=4*pi*le-7; 
e0=8.854e-12; 
c=l/sqrt(muO*eO): 
dx = 4.2e-2; 
dt=alpha*dx/c; 
f_mod = le9; 
lambda = c/f_mod: 
x_end = 400; 
t_0 = 3e-09; 
dxl = dx/lambda; 
index= [0: x_end] ; 
e=zeros(length(index),1); h=e 
ha = plot(e); 
set(ha,’EraseMode’,’xor’) 

7. 

t_end = 500; 
for n = l:t_end 
for i=l:400 


7. alpha = c*dt/dx 

7. magnetic permeability 

7. electric permittivity 

7, velocity of light in a vacuum 

7. space step 

7. time step 

7. modulating frequency 
7. wavelength 

7. peak position of Gaussian pulse in time 


el=e; h0=e; 

7. get the handle to the plot 

7. for simulation of wave propagation 


7, number of time steps 


7. simulate for h-nodes 


h(i) = h(i) - (e(i+l)-e(i))*dt/dx/mu0; 


for i=2:400 7. simulate for e-nodes 

e(i) = e(i) - (h(i)-h(i-l))*dt/dx/e0; 

7, launch Gaussian or modulated Gaussian pulse 
T = t_0/3; 7. specify pulse width 

y(n) = exp(-(((n-l)*dt-t_0)/T),"2).*cos(2*pi*f_mod*((n-l)*dt-t_0)); 

7. y(n) = exp(-(((n-l)*dt-t_0)/T) . ~2) ; 

e(l)= y(n); 7. Gauss pulse starts here 

drawnow; 7.refresh data and plot the current condition of the pulse 
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set(ha,’YData’,(e)); 

end 
pause 
close all 

Listing 11A.6 Function fdtd 2D.m. MATLAB program which describes propagation of 
a Gaussian pulse in 2D. 

% File name: fdtd_2D.m 

7. Program for finite-difference time-domain method in 2D 


7. Propagation of Gaussi 
clear all; 

" pul " 

N_x = 60; 

N_y = 60; 

N.steps = 60; 

7« 

7. number of steps in x-direction 

7. number of steps in y-direction 
l number of time steps 

N_x_middle = N_x/2.0; 
N_y_middle = N_y/2.0; 

7. 

l middle point; location of pulse 

1 

delta.x = 0.01; 
delta.t = delta_x/6e8; 
epsilon = 8.8e-12; 

l cell size 

7. Initialization 

E_z = zeros(N.x,N.y); 
H_x = zeros(N.x,N.y); 
H_y = zeros(N.x,N.y); 

l Initialize field to zero at all points 

t.zero = 20.0; 
width = 6.0; 
time = 0.0; 

7. center of incident pulse 

7. width of the incident pulse 

for istep = l:N_steps 
time = time + 1; 

•/. »ai» loop 

l calculate E_z field 


for i = 2:N_x 

for j = 2: N.y 



E_z(i,j)=E_z(i,j)+0.5*(H_y(i,j)-H_y(i-1,j)-H_x(i,j)+H_x(i,j-1)) 

l Gaussian pulse in the middle 

pulse = exp(-0.5* ((t_zero - time)/width) ~2); 7, Gaussian pulse 
E_z(N_x_middle,N_y.middle) = pulse; 

7 . 

% Calculate H_x field 
for i = 1:(N_x-1) 

for j = 1:(N_y-1) 
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H_x(i,j) = H_x(i,j) - 0.5*(E_z(i,j+1) - E_z(i,j)); 

% 

I Calculate H_y field 
for i = 1:(N_x-1) 

for j = 1:(N_y-1) 

H_y(i,j) = H_y(i,j) + 0.5*(E_z(i+l,j) - E_z(i,j)); 

end 

% -Plotting- 

[x,y] = meshgridCO:1:N_x-l,0:1:N_y-1); 
mesh(x,y,E_z); 

xlabel(’\bfx’,’FontSize’,14); 
ylabel(’\bfy’,’FontSize’,14); 
zlabel(’\bfE_z’,’FontSize’,14); 

set(gca,’FontSize’, 14); 7, size of tick marks on both axes 

axis([0 60 0 60 0 1]) 

text(50,40,0.5, ’time = 60’, ’Fontsize’, 14) 

pause 

close all 
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Beam propagation method (BPM) 


L 


In this chapter we summarize the beam propagation method (BPM). The method is widely 
used for the numerical solution of the Helmholtz equation and also for the numerical 
solution to the nonlinear Schroedinger equation (to be discussed in Chapter 15 dealing 
with solitons). It is the most powerful technique for studying the propagation of light in 
integrated optics. The method was originally introduced by Feit and Fleck in the late 1970s 
[1], The BPM was initially based on FFT algorithm. Later on it has been extended to 
finite-difference based BPM schemes (FD-BPM) and finite-element BPM (FE-BPM) and 
many others. The main characteristics [2] of FD-BPM are the ability to simulate structures 
with large index discontinuity, less memory and time consumption in modelling complex 
structures, the possibility to incorporate wide-angle and full vector algorithms and the 
ability to incorporate transparent boundary conditions. 

There is a large number of algorithms available in the literature and almost all of them 
are based on the concept of a propagator. Propagators are mathematical ‘objects’ which 
propagate fields from one space coordinate to another. An example of the system where 
propagation takes place, known as an optical waveguide, is shown in Fig. 12.1. It splits 
input optical signal into two arms, see also Fig. 12.2. The role of BPM is to determine field 
profile along the waveguide knowing the distribution of refractive index over the whole 
waveguide. 

There is a huge amount of literature on the principles of BPM and its applications in 
integrated photonics. We recommend books by Kawano and Kitoh [3], Pollock and Lipson 
[4], Okamoto [5] and Lifante [6], We start our discussion by illustrating the principles of 
BPM in the paraxial approximation. 


12.1 Paraxial formulation 


12.1.1 Introduction 


We start this section by outlining the simplest version of BPM, where one assumes scalar 
electric field and paraxial approximations which restrict its applicability to the fields prop¬ 
agating at small angles with respect to the axis of the waveguide (guiding axis), which 
we define as the z axis. The geometry of the waveguide is determined by refractive index 
n(x,y,z). The method consists of the repeated propagation of the electric field from a 
perpendicular plane at a given position along the waveguide to the next parallel plane. To 






Paraxial formulation 




Schematical illustration of a typical problem solved by BPM. Knowing the input electric field on the left, determine 
fields at the output (shown by dotted lines on the right). 


illustrate the method, one starts from the wave equation for a monochromatic wave: 


d 2 E 3 2 E 
dx 2 3 y 2 


3 2 E 
3 z 2 


+ k 2 (x,y,z)E = 0 


( 12 . 1 ) 


Here, the spatially dependent wavenumber is given by k(x,y,z) m k 0 n(x,y, z) and k () = 
2n /X is the wavenumber in free space. The main assumption is that the phase variation due 
to propagation along the z axis represents the fastest variation in the field E. This variation 
is exemplified by introducing slowly varying field u and expressing E as 


E(x,y, z) =u(x,y,z)e~ jf>z 


( 12 . 2 ) 


where fi is a constant that represents the characteristic propagation wave vector, fi = n 0 co/c. 
Here no is a reference refractive index which can be, for example, the refractive index of 
the substrate or cladding, ft represents the average phase variation of the field E and it is 
known as propagation constant. Substituting Eq. (12.2) into Eq. (12.1) yields the equation 
equivalent to the exact Helmholtz for the slowly varying field 


3 2 u 3m 3 2 u 

_ 3? + ‘dz ~ 3* 


3 2 m 

3y 2 


+ (k 2 - p 2 )u 


(12.3) 


At this stage one assumes the slow variation of optical field in the propagation direction 
(SVEA or slowly varying envelope approximation), which requires 


I 3 2 m I 

I 9 z2 I 


(12.4) 
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This approximation allows us to ignore the first term on the left hand side of Eq. (12.3) 
with respect to the second one. It is also known as Fresnel approximation. Eq. (12.3) thus 
reduces to 


2 W 


3 u 
3z 


3 2 u. 

3? 


d*u 

3/ 


+ (k 2 - P 2 )u 


(12.5) 


which is known as a Fresnel or paraxial equation. It is a starting point for the description 
of the evolution of electric (or magnetic) field in inhomogeneous medium, for example an 
optical waveguide. It does not describe polarization effects. It determines the evolution of 
the field for z > 0 given an input u(x, y,z = 0). 

The above equation can be expressed as 


2jfi — = Du + Wu 
dz 

with the operators on the right hand side being 


D_—(— — 

2jfi \ dx 2 3 y 2 




( 12 . 6 ) 


(12.7) 

( 12 . 8 ) 


Operator D represents free-space propagation (diffraction) and operator W describes guid¬ 
ing effects. 

Assuming that operators are z-independent, the solution is symbolically written as 

u(x, y, z + Az) = e (S+ ^ )Az u(x, y, z) 


Use the Baker-Hausdorf lemma [7] 

eV = e^ +S e^ ] 


given that A and B each commutes with [A, 5], Using it, we can approximate 
u(x, y,z+ Az) w e SAz e WAz u(x, y, z) 

From the above, it follows that actions due to both operators can be considered indepen¬ 
dently. 

The action of operator D is better understood in the spectral domain. It will be de¬ 
scribed in the next section. The second operator W describes the effect of propagation 
in the presence of medium inhomogeneities and its action is incorporated in the spatial 
domain. 


12.1.2 Operators D and W 


Consider an equation containing only operator D: 

du(x,y,z) 


2 JP- 


_ ( 3 2 3 2 \ 

“ \ 3 ? + df) 


u(x,y,z ) 


(12.9) 
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Define 2D continuous Fourier transform as 

u(k x , k y , z) = J u(x, y, z)e- j(k ‘ x+k > y) dxdy = F x<y {u(x,y, z)} 

and its inverse 

u(x, y,z) = J u(k x , ky, z)e J<k * x+k > y) dk x dk y F x -' [u(x, y, z)} 

Apply the 2D FT to Eq. (12.9) and have for Fourier components 

2j/3 £-u(x, yz) = -(k 2 x + kl)d(x, y, z) 
dz y 

Integrate the above from z to z + Az and have 

u(x, y,z + Az) = e~Vf (k ^ )Az u(x, y, z) 

= H(k x , ky, A z)u(x,y,z) (12.10) 

To analyse operator W, introduce the relation 

fi = hn eff (12.11) 

Write also refractive index n as 

Ti = n eff + An (12.12) 

Substituting the above into Eq. (12.8) gives an expression for operator W to the first order 
in A n 

W = -jk 0 An (12.13) 

12.1.3 The implementation using the Fourier transform split-step method 


A general scheme of FD-BPM is described by the propagator U, which we use to advance 
field as 

E ? (z+ Az) = U (Az)E,(z) 

Propagator U can take many forms depending on the chosen BPM technique. Its operation 
is illustrated in Fig. 12.3. In practice two versions of BPM are used: 1 + 1 FD-BPM and 
1+2 FD-BPM. The nomenclature refers to one dimension along the propagation direction 
(usually the z-axis) and one or two perpendicular dimensions (along x-axis (ID case) or 
in x-y plane (2D case)). Finite difference discretization in the 1 + 1 and 1+2 cases is 
illustrated in Fig. 12.4. 

We illustrate the application of Fourier transform split-step BPM for the propagation of 
a two-dimensional Gaussian pulse. The guiding structure is divided into a large number of 
segments. During each segment of length Az, the optical pulse is propagated as shown in 
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another plane along the z-axis. 


-O—O—O—X—X- 1 + 1 FD-BPM 

2-1 2 2+1 



Illustration of finite-difference discretization for ID and 2D BPM. 


the flow diagram, see Fig. 12.5. The accompanying MATLAB script is shown in Appendix, 
Listing 12B.1. The results of a 3D view of pulse before propagation and after propagation 
are shown in Fig. 12.6. One can observe spread of pulse width as expected. 


12.2 General theory 


12.2.1 Introduction 


Polarization effects are not included in the previous version of BPM. When one wants to 
describe polarization effects, the vector wave equation must replace the scalar Helmholtz 
equation used earlier. In this section we will outline a general approach to systematically 
handle those effects. General formulation is based on an approach established by W. P. 
Huang’s group [8], [9]. As usual, one starts from Maxwell’s equations in the frequency 
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No 


Flow diagram for the BPM split-step method, 
domain: 


V x E = — 7 <u/r, 0 H 

(12.14) 

V x H = j(os 0 n 2 E 

(12.15) 

V ■ h 2 E = 0 

(12.16) 


where we have introduced refractive index n as s = son 2 . We separate all quantities into 
longitudinal (along z-axis) and transversal components, as follows: 

V = V, +z—, E = E; +zE Zf H = H, +7H- 
dz 

v ' = [s'<H- e '=^ 01 ' 



where 
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0.8 




are the transversal components and? = [0, 0,1] is the unit vector along z-axis. Eq. (12.16) 
can be rearranged as follows: 


V • « 2 E = 0 
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or 


or 


The z-derivative is 


dE z 1 dn 2 1 , 

— = — ~E Z - T V f • n 2 E, 

« 2 ?; 2 


(12.17) 


If the refractive index varies slowly along z-axis, we can set ^ 
[9]) and obtain 


dE z 


0 (z-invariant structures 


(12.18) 


The above relation is exact for z-invariant systems, i.e. when there is no change of refractive 
index n 2 along z-axis. 

The wave equation is derived by taking V x • • • operation on Eq. (12.14). One finds 


V x V x E = -jeon 0 V x H = co 2 /j, 0 s 0 n 2 E, (12.19) 


In the last step we have used Eq. (12.15). Next, we use the following general relation which 
holds for arbitrary vector 

V x V x E = V (V • E) - V 2 E 
Applying it to expression (12.19), we obtain wave equation 

-V 2 E + V (V • E) = n 2 ^E (12.20) 

where qj 2 [Ao£q = k^. 

Transversal components of the wave equation (12.20) are 

-V 2 E x + — (V ■ E) = n 2 £E x (12.21) 

dx 

-V 2 £ v + — (V • E) = n 2 klE y (12.22) 

dy 

where V 2 E X = + ^ £ . We will analyse the behaviour of E- component. In the 

above equations, term V ■ E is expanded as 

„ dE x dE y dE z 
dx dy dz 



296 


Beam propagation method (BPM) 


and the last term is replaced using relation (12.18). One finds 
dE x dE y 1 8 , 19 

V ' E = - ~2^~ nE x - =2 Ey 

ox ay n ox n oy 

Substituting (12.23) into (12.21), one obtains 

9 1 9 d 2 E x d 2 E x did. d% .. 

Tx¥Tx nEx+ W + ~M + *x¥Ty n E *-drty +nk Z Ex = ° 

Similarly, from (12.22) we have 

9 1 9 o d 2 E x d 2 E v d 2 E y 9 19 _. „ 

YyWTx” Ex ~ Wx + + ^ + TyWdy" y + n y = 0 

The above equations in a matrix form are 


dx n 2 

Tx " 

+ 9/ + 9? + ”^ 

dx n 2 

dy n ‘ 

dxdy 

9 1 

9 _ 2 9 2 

9 1 

9_ ^ 

, d 2 

-dyn 2 

dx 

dydx 

dyn 2 

dy" 

+ dx 2 + 


nX 

Those equations form the starting point for detailed approximations. 

12.2.2 Slowly varying envelope approximation (SVEA) 

Assume that the transversal field components E x and E v are of the form 
E x (x,y,z) = u x (.x,y,z)e~ jfSz 
E y (x,y, z) = u y {x,y,z)e-j pz 

SVEA requires that 


ia- 


dz 2 


dz 


= 0 
(12.26) 


(12.27) 

(12.28) 


(12.29) 


Applying SVEA to term ^ where (i = x, y), one obtains 

a 2 a 2 r aV g M; 1 


I dz 2 


\ 9z J 

Using SVEA in Eqs. (12.24) and (12.25) results in the paraxial wave equations [9] 

du x 

J-^~ = 
dz 


+ A X yUy 

(12.30) 

+ AyyUy 

(12.31) 






General theory 


where the differential operators are defined by [9] 


1 

1 3 1 3 2 8? 3? , 2 , 2 y\ 1 

I wi'- t v t s +w "' l r 

(12.32) 

XXUX ~ 20 ' 


1 J 3 t 3 2 d y \ 

(12.33) 

A xy 

Uy 2/3 1 dx n 2 3 y Hy 3x3 y j Uy 


1 f 3 1 3 2 d x d y 1 

(12.34) 

A yx U x ■ 

2/1 | dy n 1 3x” x dydx 3x 2 j U * 


f 3v a y 3 1 3 2 / 2,2 2\ 1 

[d^ + d? + Yy¥Vy ny + {nk ^~ P) \ Uy 

(12.35) 

The above can be written in j 

i matrix form: 



ii 

V 

(12.36) 


The evolution of electric field described by Eq. (12.36) is known as vectorial BPM [9]. It 
takes into account the polarization (A xx / A }y ) and includes coupling between E x and E y 


12.2.3 Semi-vector BPM 


Often the coupling between two polarizations is weak and may be neglected. The two 
polarizations are decoupled as long as they are not synchronized with some mechanism 
within the device. The resulting equations which describe semi-vector formulation are 



Under the semi-vector approximation, the polarization dependencies of the EM waves are 
taken into account. 


12.2.4 Scalar formulation 


If the structure is weakly guiding and/or the polarization dependence is not important, we 
can neglect it. The resulting scalar approximation is described by the equation 
3 u 

j—=Au (12.37) 


where the operator A is 




The above reproduces a simple theory summarized in the introductory section. 
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We finish this section with the remark that similar equations can be derived for the 
magnetic field, known as the H-formulation (consult [9] for more details). 


12.2.5 Finite-difference (FD) approximations 


In this section we provide finite-difference discretization of the above equations. Due to its 
simplicity, the FD is the popular numerical method of solving such problems. One replaces 
continuous space by a discrete lattice where all fields are determined at the lattice’s points. 
The lattice points are defined by x, = i ■ Ax and yj = j ■ Ay, see Fig. 12.4. Some details 
of discretization are provided in Appendix 12A. The final discrete form of the previous 
equations for E-formulation is [9] (here we made a replacement e = n 1 ) 


A, 


1 f Tj j + \u x (i, y+1)-[2- R iJ+1 - Rt^j] u x (i, j ) + T^juJi - 1, J) 
2/3 [ Ax 2 


. u x (i, j + 1) - 2 u x (i, j ) + u x (i, j - 1) r 

+ -v- + h 




(12.38) 


J_ | T i+ iju x (i + 1,7) - [2 - Rj+ij - u y (i, j ) + T Uj _ x u y {i, j - 1) 

= 2P | 

u y (i +1,7)- 2 Uy(i, 7) + u y (i -1,7) 


Ax 2 


- + [s iJ , k -p 2 ]k 2 Uy(i,j)\ (12.39) 


- (+Sr - 0 uAI ~ 1J+,)+ (+£f “ ') uAI ~ '■ 


u x (i+ 1,7-1) 

7-l)J (12.40) 


A, 


1 | ( et+U+U 

SfiAxAy { \ Si, j+1 ,k 



- — 1J u x (i + 1,7 + 1) — (' 
[ (» + l,7-l)+( e, '~ W ~ U 


In the previous equations 



Ri+ij = T mj - 1 


1,7+1) 
>•-1,7-1)] (12-41) 


(12.42) 


(12.43) 
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are the transmission and reflection coefficients across index interfaces between points i and 
*+l. Similarly, 

T uj ± i = 2SiJ± '' k (12.44) 

£ i,j±l,k + s i,j,k 

Ri,j ±i = T u±l - 1 (12.45) 

are the transmission and reflection coefficients across index interfaces between points j and 
7+1. 


12.3 The 1 + 1 dimensional FD-BPM formulation 


If we can neglect y-dependence of the refractive index (wide waveguides), the scalar 
formulation described by Eq. (12.37) can be further simplified to the 1 + 1 description. 
Cross-sectional dependence of the refractive index is n = n(x, z). The relevant Helmholtz 
equation is [10] 

2JP Yz = 9 ? + ~ ^ “ (12 ' 46) 

where u is the only electric field component of the TE mode of the waveguide. This equation 
will now be analysed using two methods. The second method will be implemented. 

12.3.1 Simple approach 


Here we discuss the FD-BPM formulation as described by Chung and Dagli [10]. First, 
replace continuous field u(z, x ) by its discrete values as 

Ui = u(i-Ax,z), i = 0,l,2...N-l 


Second derivative is approximated 


d 2 U _ M,_i - 2 Ui + u i+ 1 

a? - a? 

The resulting finite difference equation is 


" 9z Ax 2 

Integrate the equation using trapezoidal rule 


'-+ (n 2 k 2 0 - f) 2 ) Ui=f(z) 


and have 




f(z)dz 


2 jf> [Ui(z + A z) - u t {z)] = - A z [/J(z + Az) + f(z)] 
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Using the definition of f(z) in the above formula and combining relevant terms, one finds 

[ 10 ] 

—aui +1 (z + Az) + but (z + Az) — aw,_i (z + Az) = aUj + i (z) + cuj(z) + aw,_i (z) 

(12.47) 

where 

Az 

“ “ 2A? 

b = ^ - * m ^( z + Az ) - ^ 2 ]+ yp 

C = - J ^+ l 2 Az [^Hy-P 2 ]+2jP 

The scheme results in a tridiagonal system of linear equations. For more details, see [10]. 


12.3.2 Propagator approach 


Approach 1 


When one approximates propagation within the waveguide by a one-dimensional approach 
using variables (z, x) withz as the propagation direction, they dependence is irrelevant and 
one then sets 3/3 y = 0. The resulting equation is 




\ + p 2 )u 


where k = nko. 

The finite-difference discretization of Eq. (12.48) in ID is the following: 


1 - 2«? + iif_, 

= 2/1 ~Kx 2 


+ 2 ^> 


(12.48) 


(12.49) 


where kf = n 2 (x) = n 2 {Xi). With an introduction of the propagation operator P, the 

above scheme is 


- = E« 


The matrix elements of the operator P are 




The solution of Eq. (12.50) can be written as 


where if" is a column vector which consists of elements u", I is the identity matrix and 
P is the propagation matrix operator having elements given by Eq. (12.51). The scheme 
is known as the explicit scheme. It is numerically unstable if the h step is too large. 
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The improvement is made by applying operator P to the future value of u [11]. From 
Eq. (12.50) 

= jy iklf +x ( 1 2 . 53 ) 

or 

~u " +l =~u n - jh ~u n+l 


or 

(T + jh*?)-fr n+x = ~Z n 

The solution is 

H n+ ' = (T + jh*?y X ~u n (12.54) 

The scheme is called implicit method, and it is stable, see Problem. 

The combination of the above two methods is created by taking the average between 
implicit and explicit schemes. The new scheme is known as the Crank-Nicolson method 
[11], It is both more accurate and stable. Adding Eqs. (12.50) and (12.53), one obtains (the 
Crank-Nicolson scheme) 


u n +1 _ u n « 

2 ;'^ r ^ = X > «+< +1 


=-$ n -\jh¥ {u n k + u n + v ) 

The above can be expressed as 

(r + Tf n+i m(r- -t” 

or 

L+ !t n+1 = Z_ lt n 


L+ = r+\jh y 


and 


1 . 


L_= I --jhP 

From that equation, the final form which can be implemented is 
~u n+1 = IT 1 L_~ti n 


(12.55) 

(12.56) 

(12.57) 

(12.58) 

(12.59) 

(12.60) 


(12.61) 
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Approach 2 

The above derivation can be made more formal. For that purpose, write Eq. (12.48) as 

j d Y= Pu (12.62) 

az 

where the propagation operator P is defined as 

P=^[d 2 x + tf-p 2 )} (12.63) 

The solution of Eq. (12.62) is 

uiz) = e~ pp u{ 0) 

or, when one introduces ‘small’ step h: 

u(z + h) = e- jhP u(z) (12.64) 

For a small step, the solution (12.64) can be approximated as 

u(z + h) « (1 -jhP)u(z) (12.65) 

One might notice that the solution (12.64) can also be written as 

e> hP u(z + h) = u(z) (12.66) 


or 


(1 +jhP)u{z + h)^ u(z) 

(12.67) 

u(z + h) « (1 + jhP)~ l u(z) 

(12.68) 

To solve this problem, a practically important method is known as the Crank-Nicolson 
scheme which is obtained from the solutions (12.65) and (12.68). One thus makes the 
following manipulations: 

*(*+»> = (■+!'’)" 4+*) 


= K p )" ('+§> 

(12.69) 


using Eq. (12.65) for h —* h/2. The implementation of this scheme is shown in Appendix, 
Listing 12A.2. It is used to propagate a Gaussian pulse in an empty space. In our im¬ 
plementation we assumed that f) = n ■ k(>. With this choice, last term in propagator P 
vanishes. 

Profiles of a Gaussian pulse at various locations along the propagation direction in a 
free space are shown in Fig. 12.7. One can observe reduction of a peak value and also the 
effects of reflections from the boundaries of the computational window. Elimination of those 
reflections will be discussed in the next sections. In Fig. 12.8 we show a three-dimensional 
view of profiles of Gaussian pulses shown in Fig. 12.7. 
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Profiles of Gaussian pulses at various locations along z-axis. 



| Three-dimensional view of profiles shown in Fig. 12.7 of Gaussian pulses propagating in a free space. 


Approach 3 

Before starting the discussion of boundary conditions, we want to summarize yet another 
possible numerical approach to Eq. (12.48). It will then be used to derive details of trans¬ 
parent boundary conditions. 

Assuming an equidistant step along x-axis and using standard formulas (h = Az) 


du _ wf +1 - u" 
dz h 


(12.70) 


3 2 u Ui+i — 2 m , + Uj—i 

a? = A? 


(12.71) 
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the discretized version of Eq. (12.48) is 

= (12.72) 

Introduce modifications on the right hand side of Eq. (12.72): 

u i+ i = ^(</+«? +I ) ( 12 ' 73 ) 

= 2 «/+“?-.) (12-74) 

[*?(«+!)+*?(«)] (12.75) 

After new algebraic steps, Eq. (12.72) can be expressed as 

«r'+f {b h «>' - 2 "? + ‘ + W<» ■+»■- fl < + '} 

*■“? - y { ^ X? (u? « ■ 2 “ ; + “W + ^ W <n) - “f} 02.76) 

Note the different« dependence in kf. The final form of the above in a condensed form is 
L + (« + l)lf n+1 = T_ {n)~u n (12.77) 

where 

i+ (* + mt"' = „ r . + 2| j_L_L (< ;_ 2<+ . + „._ + . ) 

+ ^[^(«+ 1) -yS 2 ]< +1 ) (12.78) 


L_(n)Tr n = < - y - 2< + «?_i) + [*?(n) - £ 2 ] <J (12.79) 

12.3.3 Transparent boundary conditions 


To eliminate reflections shown in the previous example, one needs to introduce appropriate 
boundary conditions outside the computational window. The popular solution is to introduce 
the so-called transparent boundary conditions (TBC) which were invented by Hadley [12], 
[13]. TBC have been extensively discussed in the literature [14], [15], [16], [17]. 

Consider the system with left and right boundaries as shown in Fig. 12.9. Nodes xo and 
x,v + 1 are outside the system; however, they are needed in the implementation of numerical 
scheme. 
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i=0 1 2 3 

o--»-♦-• 


N-l N 



N+l 

■ O —> x 


Numbering of nodes used in deriving transparent boundary conditions. 


Left-hand boundary 

Analyse the left boundary first. Assume that near the boundary the field is approximated as 


m(x, z) « 

A(z) e ikxX 

(12.80) 

For the first left points shown in Fig. 12.9, one obtains 


Mo = M(x 0 ) : 

= A(z) e ikxX0 

(12.81) 

Ml = M(Xl) : 

= A(z) e ikxX ' 

(12.82) 

«2 = U(X 2 ) ■■ 

= A(z)e ikxX2 

(12.83) 

From the above equations one finds 



M2 e lkxX1 

Mi e ik * x ' 

*Ax2-x,) _ e ik x Ax 

(12.84) 

and 



Uq e ik * x ° 

lk x (x,-X0) - e ik x Ax 

(12.85) 


assuming uniform grid; i.e. Ax = xi — x\ = x\ — xo. From the above equations, one 
determines value of the field at the outside point xq: 


uq = mi e lkxAx (12.86) 


Wave number k x is determined from known fields using Eq. (12.84): 


(12.87) 


For a field travelling leftward (an outgoing wave), the real part of k x , Re(k x ) is positive. 

The implementation of TBC at the left point for operator U+ x will be illustrated now. 
Consider the first element for i = 1, which from Eq. (12.78) is 


L n + X ( 1) = u n + x + - jh- - l —(-2)u n + x + - jh- -4«n +1 

+ W 1 2 J 20 Ax 2 V 1 ~ 2 J 2fi Ax 2 0 

Replacing Mq + 1 using Eq. (12.86) gives 

in.) = + 

= (old term + left correction) m" +1 
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The last expression is directly implemented. For operator L" +1 there is only a change of 
sign. Similar steps can be taken for the right boundary, which is left as a problem. 


Right-hand boundary 

At the right-hand boundary, the point x,y+1 is outside the system and the field there must 
be determined. One starts with assuming the following expression for the right-travelling 
field: 

u(x) « A(z) e-** (12.90) 

Based on this assumption, we can write fields at three points around the right boundary: 

u N -i = wfev-i) = A(z) e ~ ik * XN ~' (12.91) 

u n = u(x n )=A ( z ) e*** (12.92) 

mjv+i = u(x jv + i ) = A(z) e ~ lkxXN+ ' (12.93) 

Here u N _\ and u N are fields within the system and u N+ \ is outside. From the above relations 
one determines the outside field as 


Un+ 1 = U N 


(12.94) 


where the wavenumber is 


(12.95) 


Cross sections are shown in Fig. 12.10. One can observe that reflections were eliminated. 
The three-dimensional view of an initial Gaussian pulse at various positions is shown in 
Fig. 12.11. MATLAB code is shown in Appendix, Listings 12B.3 and 12B.3.1. 


12.4 Concluding remarks 


A few years ago an article was published with the title ‘What is the future for beam 
propagation methods?’ [18]. The role played by BPM in the simulations of photonic 
devices was discussed. The authors also provided an extensive literature summary on the 
following issues associated with the BPM: transverse discretization, explicit and implicit 
formulations, vector effects, wide angle effects, reflective schemes and time domain BPM. 
They concluded with highlighting the outstanding issues (as of 2004) of BPM, which 
were: flexible meshing, development of wide-angle schemes, improvement of boundary 
conditions and creation of hybrid BPM schemes, e.g. with time and/or frequency domain 
methods. 

A more recent assessment of BPM (and also Eigenmode Expansion Method and Finite 
Difference Time Domain) for Photonic CAD is reported by Gallagher [19]. He summa¬ 
rizes the main algorithms used in photonic modelling and discusses their strengths and 
weaknesses. 
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Comparison of field profiles at various locations along z-axis of Gaussian pulse propagating in a free space by BPM with 
transparent boundary conditions. One observes reduction of its peak value and spreading of pulse. No reflections are 
observed. 



Three-dimensional view of Gaussian pulse shown in Fig. 12.10 propagating in a free space by BPM with transparent 
boundary conditions. 


12.5 Problems 


1. Develop H-formulation of BPM. 

2. Perform von Neumann stability analysis for the implicit method. 

3. Based on the example in the text for TBC for the left boundary, develop expressions 
for the right TBC. 
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12.6 Project 


1. Implement 1-1 FD-BPM as discussed in Section 12.3.1. Consult work by [10]. Analyse 
waveguides described in Ref. [10]. 


Appendix 12A: Details of derivation of the FD-BPM equation 


We describe details of discretization of the earlier equations, like Eq. (12.32). We present 
details for the ID case only. Generalization to 2D is straightforward. A typical term which 
needs discretization is 


Simple approach 

First-order derivative is 

90 _ 4>p+i - 4> P 

dx ~ Ax 

where <j> = eu. Applying the above formula with difference centred around the points p+\ 
and p—\, one finds 

K _ J_ I 1 <t>p+ i - 0 P _1 <Pp ~ <Pp -1 

Ax I i Ax s„ i Ax 
l P+2 P -2 

where 


<f)p±\ = £p±iUp±\ 

<j)p = s p u p 

We also use the following intuitive relations (which will be proved in the next section): 

_ Bp+ 1 + S p 

b P+2 2 

Sp +1 + Ep-I 
E p-\ - 2 

Combining the above relations, one obtains 

K _ J. 2 £ p+ \u p+ \ - SpUp _ 1 2 s p u p - Sp-iUp-1 

Ax £p +1 + £p Ax Ax £p + £p-\ Ax 



\ *P + £ P } 

1 £pl ] 

1 Ep+ 1 + £p ^ 

L e £+1 + E p e p+ s p- 1 J 

Sp + Ep— i J 






















309 


Appendix 12A 


Taylor series approach 

We differentiate 

319 31 / ds\ 3 2 u 

dx e dx dxe \dx J dx 2 

First term is evaluated as follows. First, we expand around point p: 


[ (?)“) = ('(r)“) ± t f'(r) 

s y dx J ) I \e\dxj J p 2 dxe\d x) 


Subtracting gives 


c;©{-i|(;(S)-) H -0(E)-) H | «“> 

From the above, one can observe that the values of u and e are needed at intermediate points 
p + \ and p—\- Those can be obtained by performing expansions 


du I Ax 

Up+l + aiLi T" 


du I 

Up¥ *~ Kl 


Adding the above gives 


Expansions of e gives 


Adding the above gives 


Subtracting gives 


e p +1 — e p+| 


3 s I Ax 

£p = £ p+k- feLiT 


= -(fip + Ep+ 1) 


— =-(e p+l -e p ) 

dx\ p+l 2 
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From Eqs. (12.98) and (12.99) we have 

1 3e I _ 2 Sp+ 1 - s p 
e dx l^i Ax s p + Sp+\ 

Using Eqs. (12.97) and (12.100), we obtain 

1 de I _ 1 s p+ 1 - s, 
sdx \ p+ i A xsp + sp+i 


i u P "F Up+ 1) 


In exactly the same way, performing the relevant expansions around point p — \ one 
finds 

~«| -- J_ £ p S p~ x (Up + Up-i) (12.102) 

S OX \p_i Ax Sp + Sp -1 

Substituting Eqs. (12.101) and (12.102) into (12.96), we obtain an intermediate result 

£j ’~ 1 («p + »P-|)| (12.103) 

9x e \ 9x/ p Ax- | + e p+ \ s p + e p _i J 

A complete derivative is obtained by combining discretization for a second derivative 


with expression (12.103). The result is 
9 1 9 

K = 

dx S dx 


AxM s p + Ep+l p+l \s p + s p+1 " e p + Bp¥l J 

The last result is the same as the one obtained by Lidgate [20], 


Appendix 12B: MATLAB listings 


In Table 12.1 we provide a list of MATLAB files created for Chapter 12 and a short 
description of each function. 


Table 12.1 List of MATLAB functions for Chapter . j 

Listing 

Function name 

Description 

12B.1 

12B.2 

12B.3 

12B.3.1 

pbpm.m 

f d bpm f'ree.m 

bpmJbc.m 

Gaussian pulse in free space in the paraxial approximation 
Gaussian pulse in free space (Crank-Nicholson) 

Transparent boundary conditions 

Function called by bpm Jbc.m. Performs single step 
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Listing 12B.1 Function pbpm.m. Function performs propagation of a two-dimensional 
Gaussian pulse using the Fourier transform split-step beam propagation method in the 
paraxial approximation. 

•/. File name: pbpm.m 

% Propagation of 2D Gaussian pulse by 
clear all N= 10; delta_x = 1/N; 
x = -5:delta_x:5; % 

y = -5:delta_x:5; % 

delta_z =5.0; % 

delta_n = 0.4; k_zero = 100; 
beta = 20; % 

7. 

u_init=(exp(-x.~2))’*exp(-y.“2); % 

mesh(x,y,abs(u_init)); % 

pause close all 

7. 

k_x = -5:1/N:5; % 

k_y = -5:1/N:5; % 

7. 

temp = delta_z/(2*beta); H_transfer = 

(exp(li*temp*k_x.~2))’*(exp(li*temp*k_y.~2)); H_transfer = 
fftshift(H_transfer); S_phase = exp(-li*k_zero*delta_n); 

7. 

for n=l:100 

z = fft2(u_init); 
zz = z.*H_transfer; 
u_prime = ifft2(zz); 
u = S_phase.*u_prime; 
u_init = u; 


paraxial FT split-step BPM 

creation of space arguments 
creation of space arguments 
step size along z-axis 

propagation constant 

initial Gaussian pulse 
plots original pulse 

creation of Fourier variables 
creation of Fourier variables 


mesh(x,y,abs(u_init)) 7. plots pulse after propagation 

pause close all 


Listing 12B.2 Program fd_bpm_free.m. Illustrates propagation of a Gaussian pulse in a 
free space. 

7. File name: fd_bpm_free.m 

l Propagation of Gaussian pulse in a free space by Crank-Nicholson method 
l No boundary conditions are introduced 
clear all 

L_x=10.0; 7. transversal dimension (along x-axis) 

w_0=1.0; 7. width of input Gaussian pulse 

lambda = 0.6; 7. wavelength 
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n=l.0; 

k_0=2*pi/lambda; 

N_x=128; 

Delta_x=L_x/(N_x-l); 
h=5*Delta_x; 

N_z=100; 

plotting=zeros(N_x,N_z); 
x=linspace(-0.5*L_x,0.5*L_x,N_x); 


7. refractive index of the medium 
*/„ wavenumber 
7. points on x axis 
% x axis spacing 
"/, propagation step along z-axis 
*/„ number of propagation steps 
"/, storage for plotting 
% coordinates along x-axis 


E=exp(-(x/w_0) . *2) ; °/ 0 initial Gaussian field 

7. 

’/, beta = n*k_0. With this choice, last term in propagator vanishes 
prefactor = 1/(2*n*k_0*Delta_x~2); main = ones(N_x,l); above = 
ones(N_x-l,1); below = above; 

P = prefactor*(diag(above,-l)-2*diag(main,0)+diag(below,l)) ; 7, matrix P 


step_plus = eye(N_x) + 0.5i*h*P; 7o step forward 
step_minus =eye(N_x)-0.5i*h*P; 7o step backward 

7 . 

z = 0; z_plot = zeros(N_z); for r=l:N_z 
z = z + h; 
z_plot(r) = z + h; 
plottingC:,r)=abs(E).“2; 
E=step_plus\step_minus*E; 


for k = l:N_z/10:N_z 7» choosing 2D plots every 10-th step 

plot(plottingC:,k),’LineWidth’,1.5) 

setCgca, ’FontSize’, 14) ; 7» size of tick marks on both sixes 

hold on 

end pause close all 

7. 

for k = l:N_z/10:N_z 7o choosing 3D plots every 10-th step 

y = z_plot(k)*ones(size(x)) ; 7o spread out along y-sixis 

plot3(x,y.plottingC:,k),’LineWidth',1.5) 
hold on 

end grid on xlabelC’x (mm)’,’FontSize’,14) 

ylabelC’z (mm)’,’FontSize’, 14) 7o along propagation direction 

setCgca,’FontSize’, 14); 7o size of tick marks on both sixes 

pause close all 


Listing 12B.3 Program bpm_tbc.m. Illustrates propagation of Gaussian pulse in a free 
space with transparent boundary conditions. 

7. File name: bpm_tbc.m 

7. Illustrates propagation of Gaussisin pulse in a free space 
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/o using BPM with transparent boundary conditions 
l Operator P is determined in a separate function 
clear all 


L_x=10.0; 
w_0=1.0; 
lambda = 0.6; 
n=l.0; 

k_0=2*pi/lambda; 

N_x=128; 

Delta_x=L_x/(N_x-l); 

h=5*Delta_x; 

N_z=100; 

plotting=zeros(N_x,N_z); 
x=linspace(-0.5*L_x,0.5*L_x,N_x); 


”/„ transversal dimension (along x- 
% width of input Gaussian pulse 
'/, wavelength 

"/, refractive index of the medium 
'/, wavenumber 

"/, number of points on x axis 

'/„ x axis spacing 

”/„ propagation step 

"/, number of propagation steps 

'/, storage for plotting 

'/, coordinates along x-axis 


E=exp(-(x/w_0). ~2) ; 

1 

z = 0; 

z_plot = zeros(N_z); 
for r=l:N_z 
z = z + h; 
z_plot(r) = z + h; 
plotting(:,r)=abs(E) ,~2; 

E = step(Delta_x,k_0,h,n,E); 


"/, initial Gaussian field 


’/. BPM stepping 


'/, Propagates pulse over one step 


for k = l:N_z/10:N_z '/, choosing 2D plots every 10-th step 

plot(plotting(:,k),’LineWidth’,1.5) 

set(gca,’FontSize’,14) ; '/, size of tick marks on both axes 

hold on 

pause 
close all 

I. 

for k = l:N_z/10:N_z “/„ choosing 3D plots every 10-th step 

y = z_plot(k)*ones(size(x)) ; '/, spread out along y-axis 

plot3(x,y,plotting(:,k),’LineWidth’,1.5) 
hold on 

grid on 

xlabelC’x’,’FontSize’,14) 

ylabel(’z’,’FontSize’,14) '/, along propagation direction 

set(gca,’FontSize’,14) ; “/, size of tick marks on both axes 

pause 
close all 
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Listing 12B.3.1 Function step.m used by bpmJbc.m. Function performs a single step in 
BPM. 

7. File name: step.m 

function E_new = step(Delta_x,k_0,h,n,E_old) 

7 Function propagates BPM solution along one step 

N_x = size(E_old, 1) ; '/, determine size of the system 

7.-Defines operator P outside of a boundary 

prefactor = l/(2*n*k_0*Delta_x~2); 
main = ones(N_x,l); 
above = ones(N_x-l,1); 
below = above; 

P = prefactor*(diag(above,-l)-2*diag(main,0)+diag(below,1)); 7. matrix P 

7. 

L_plus = eye(N_x) + 0.5i*h*P; 7. step forward 

L_minus = eye(N_x)-0.5i*h*P; 7. step backward 

7. 

7«-Implementation of boundary conditions 

7. 

pref = 0.5i*h/(2*k_0*Delta_x~2); 
k=li/Delta_x*log(E_old(2)/E_old(l)); 
if real(k)<0 

k=li*imag(k); 

end; 

left = pref*exp(li*k*Delta_x); 7. left correction for next step 

L_plus(l) = L_plus(l)+left; 

L_minus(l) = L_minus(l)-left; 

7. 

k=-li/Delta_x*log(E_old(N_x)/E_old(N_x-l)); 
if real(k)<0 

k=li*imag(k); 

end; 

right = pref*exp(li*k*Delta_x); 7, right correction for nest step 

L_plus(N_x) = L_plus(N_x) + right; 

L_minus(N_x) = L_minus(N_x) - right; 

7. 

E_new = L_minus\L_plus*E_old; 7 determine new solution 
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Some wavelength division multiplexing 
(WDM) devices 


Wavelength division multiplexing (WDM) is a modem practical method of increasing 
transmission capacity in fibre communication systems. It uses the principle that optical 
beams with different wavelengths can propagate simultaneously over a single fibre without 
interfering with one another. In the wavelength range of 1280-1650 nm (like an AllWave 
fibre [1]) the useable bandwidth of a single mode fibre is about 53 THz. In recent years an 
improved (denser) WDM system known as DWDM is under development. 

In this chapter we discuss some of the WDM devices and also provide some applications 
of BPM developed earlier to simulate those devices. We start by summarizing the basic 
WDM system. 


13.1 Basics of WDM systems 


WDM is the main technique used in the realization of all optical networks. WDM is the 
technology which combines a number of wavelengths onto the same fibre. 

Key features include: 

• capacity upgrade 

• transparency (each optical channel can carry any transmission format) 

• wavelength routing 

• wavelength switching 

Implementation of a typical WDM system employing N channels is shown in Fig. 13.1. 
In the shown system three wavelengths are multiplexed in one fibre to increase transmission 
capacity. The light of laser diodes with wavelengths recommended by the ITU is launched 
into the inputs of a wavelength multiplexer (MUX) where all wavelengths are combined 
and coupled into a single-mode fibre. When needed, propagating light can be amplified by 
an optical fibre amplifier and eventually imputed at the wavelength demultiplexer (DMUX) 
which separates all optical channels and sends them to different outputs. 

In order to find the optical bandwidth corresponding to a spectral width in optical region, 
we use the relation c = X ■ v, where X is wavelength and v carrier frequency and c velocity 
of light. Differentiating 
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Multiplexer 



Implementation of a typical WDM link. 


Optical fibre 


Demultiplexer 



or 

|Ait| = £|AA| (13.1) 

The above equation describes the frequency change Av which corresponds to the wave¬ 
length change A7. around A.. Using the above formula, we can estimate the usable wavelength 
range for a standard single-mode fibre. Assuming that telecommunication wavelength range 
extends from A] = 1280 nm to A2 = 1625 nm, the ultimate bandwidth of optical fibre is 
40 THz [2], Assuming 50 or 25 GHz channel spacing, there is the possibility to transmit 
800-1600 wavelength channels. 

In the remainder of this chapter, we will consider some of the basic devices used in WDM 
systems and provide two applications of the beam propagation method (BPM) to simulate 
simple structures. 


13.2 Basic WDM technologies 


In an ideal WDM system where nonlinear effects are neglected, the discrete wavelengths 
can be optically processed, i.e. routed and/or switched without interfering with each other. 
Optical processing can be performed using passive or active components. 

One refers to passive components where there is no external control of their operation. 
Typically, they are used to split or combine signals. Examples of passive components are: 
N x N couplers, power splitters and star couplers. 

Active components, which can be controlled electronically, include tunable optical filters, 
tunable sources, optical amplifiers. Here, we will concentrate on passive components. 

Basic technologies which are used to develop WDM devices are [3]: fibre Bragg grating 
(FBG), array waveguide grating (AWG), thin-film filter (TFF) and diffraction grating (DG). 

Another classification of main WDM components used in fibre optic communication 
systems is described by Agrawal [4] and includes: tunable optical filters (Fabry-Perot 
filters, Mach-Zehnder filters, grating-based filters, application-based filters), multiplexers 
and demultiplexers, add/drop multiplexers and filters, broadcast star couplers, wavelength 
routers, optical cross-connects, wavelength converters and WDM transmitters and receivers. 

In the following, we will discuss basic WDM technologies. We start with fibre Bragg 
grating. 
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K h h 
K 



13.2.1 Fibre Bragg grating 


Fibre Bragg grating (FBG) devices are based on the principle discovered by Bragg in 1913 
and demonstrated in fibre by Hill in 1978 [5], It is illustrated in Fig. 13.2. 

In a fibre’s core a periodic change of the refractive index (known as grating) is created 
using the effect of photosensitivity in germanium-doped optical fibre [5], When a light 
signal consisting of several wavelengths travels in optical fibre (here from left to right), 
the signal with a wavelength which obeys the Bragg condition is reflected. The reflected 
wavelength centred at Xb that fulfills the Bragg condition is 

X B = 2A n eff (13.2) 

where A is the grating period and n e /f is the effective group refractive index of the core. 

Applications of Bragg grating are numerous and include filters and dispersion compen¬ 
sation. A comprehensive review on FBG including fibre grating lasers and amplifiers was 
written by Kashyap [6]. 


13.2.2 Array waveguide grating 


Array waveguide grating (AWG) is shown schematically in Fig. 13.3 [2], It is formed by 
several waveguides with different lengths. At both ends all waveguides are converging to 
the same points. Light composed of several wavelenths {X\,X 2 ,...) enters the device on 
the left. The length of each waveguide is carefully designed so to provide precise phase 
difference between neighbouring waveguides at the end of the guiding structure (on the 
right). At the output, a diffraction pattern is created which allows output waveguide to 
collect light at a particular wavelength. AWG can be used as a passive optical multiplexer 
and/or demultiplexer. 
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0.5 P fa 
0.5 P ta 



A four-port splitter. 





Integrated optics directional coupler. 


Output 

] - *2 


Output 
-► 3 


13.2.3 Couplers and splitters 


Optical couplers are passive devices which either split optical signal into multiple paths, or 
combine several signals into one path. A prime characteristic of couplers is the number of 
input and output ports, which is typically expressed as an N x M configuration, where N 
represents the number of inputs and M represents the number of outputs. 

A splitter is a modification of a coupler. As an example, a Y-branch power splitter is shown 
in Fig. 13.4. Typically, such Y-branch splits power evenly between the two output ports. By 
combining several Y-branches more outputs can be provided, as shown in Fig. 13.5. 

Couplers can be constructed within the waveguide structure as shown in Fig. 13.6. Such 
a structure can be formed by ion exchange, ion implantation or chemical vapour deposition. 

Couplers can also be constructed by twisting two or more fibres together and melting 
them in a flame. The process creates a fused coupler, a very popular device. 

These techniques can be used to make 50-50% or 99-1% couplers. The numbers 
indicate splitting ratios. The length of the coupling region (fused region) as well as the 
twisting determines the splitting ratio. Such couplers are simple to construct but fabrication 
requires significant experience. 

In the coupler shown in Fig. 13.6, the optical signal enters port 1. Some of its power exits 
in port 2 and some in port 3. In an ideal situation, no light reaches port 4 and also no power 
is lost. In practice, however, a few tenths of a dB are lost and the coupling to port 4 is about 
30 dB relative to input power at port 1. The percentage of light coupled to different ports 
can be varied by changing the coupling length L. 
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b 



Two coupled optical waveguides. Distribution of electric fields is also shown. 

Some of the common applications of couplers and splitters are 

• to monitor output of light locally (usually 99—1% couplers are used) 

• to distribute an incoming signal to several locations simultaneously. For example, 
four-part splitter (Fig. 13.5) allows a signal to drive four receivers. 

Next, we provide a summary of the mathematical description of a passive coupler. 


13.2.4 Mathematical theory of a passive coupler 


Consider two waveguides ‘a’ and ‘b’ (Fig. 13.7), see [7], For a single waveguide, say ‘a’, 
one can express the field as 


E (x, y, z) = E la) (x, y) a (z) 
H(x,y,z) = H ta) (x,y) a (z) 


and where 


a (z) = a 0 e !Az 


where E ia) (x, y) and H (a) (x, y) are modal distributions in (x, y) plane. They are normalized 



Also 



Total guided power 



= \a(z)\ 2 

For two parallel waveguides, fields in each one are written as 


S(x,y,z) = a (z) (x,y) + b (z) (x, y) 

H (x, y,z) = a (z) H w (x, y) + b (z) H (b) (x, y) 


(13.3) 


(13.4) 
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Amplitudes a (z) and b (z) satisfy the following (coupled-mode) equations: 
da(z) 
dz 

db(z) 


- = ip a a(z) + iKabb(z) 


= iK ha a(z) + ip h b(z) 
where K ab and K bu are coupling coefficients. Guided power is 

P = s a \a (z )| 2 +s h \b (z) | 2 + Re {a (z) b* (z) C ha + b (z) a* (z) C ah \ 


c M=\f / E <q] (*. t) X ( *, jO • xtxdy 


In the above, s a , s b = +1 is for propagation in the +z direction and s a , s b = 
propagation in the -z direction. 

Coupled mode equations can be written in a matrix form as 

iKoHEa 


The solution is assumed to be 


# = r 

|_ K ba fib J 


EM- 


After substitution into Eq. (13.9), one finds 


p-,T][^o 


where 1 is an identity matrix. In the full form 
r Pa~P Kab 1 


For non-trivial solutions, a determinant must vanish: 


det = (/? a - ft) (P b - P) - K ab ■ K ba = 0 
From the above, two eigenvalues are found as 

P = \(Pa + Pb)±Y- {£ 

where 

Y = i/A 2 + Kab-Kha, & = ^ (Pb ~ Pa) 


(13.5) 

(13.6) 

(13.7) 

(13.8) 
— 1 is for 

(13.9) 

(13.10) 

(13.11) 

(13.12) 

(13.13) 

(13.14) 

(13.15) 

(13.16) 
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LA - y. 

The general solution is therefore 


* = [a + v] or [‘P 


for ft- 


where matrix V is formed from eigenvectors as 

y = [Vi; V 2 ] 

After some algebra, one finds the final solution 

b:]- t »[s] 


T (Z ) 


|~cos yz - 


-lysmyz i-y sin yz 
sin yz cos yz + i A sin yzj 


(13.17) 

(13.18) 

(13.19) 

(13.20) 

(13.21) 

(13.22) 


As a special case, consider a situation when at z = 0 the optical power is incident only 
in waveguide 1, a (0) = 1, b (0) = 0. One finds in this case 

11 . \ 12 I I ■ 2 

|o(z)| = — sin yz 
I Y I 

At yz = , 3 j ...., (2 n +1) the power transfer from guide ‘a’to guide ‘b’ is maximum. 

Since 

I ^ba I I K ba V ^ ^ 

U(Pb- Pa)] 2 + Wba\ 2 < 

for p a ± p b the power transfer between waveguides is never complete. For more discussion 
consult Chuang [7], 


13.2.5 Optical isolators 


An optical isolator is a passive device which allows the transmission of an optical signal 
in only one direction. At the same time the reflections in the opposite direction will be 
eliminated. 

Two key parameters of an isolator are: the insertion loss which is the loss in the forward 
direction, and its isolation which is the loss in the reverse direction. Typical insertion loss 
is about 1 dB, whereas the isolation loss is 40-50 dB. 
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Illustration of the principle of Faraday rotation. 


An optical isolator based on the Faraday effect is shown in Fig. 13.8. The device consists 
of two linear polarizers and a 45° Faraday rotator. A light beam entering the device from the 
left passes through a linear polarizer which polarizes it vertically. Then it passes through a 
Faraday rotator which rotates it (here, say by 45°) with respect to vertical direction. After 
passing through a rotator, it travels through another linear polarizer aligned at 45° which 
allows the beam to pass through. Therefore, a vertically polarized beam can be transmitted 
through the device. However, all beams polarized at other angles will be blocked. 

The angle of rotation a in a Faraday rotator is given by 


a = VBL 


(13.23) 


where V is the Verdet constant, B is magnetic induction and L is the interaction length. 
For glass (crown) at a temperature of 18°C, the Verdet constant is V = 2.68 x 10 -5 deg/ 
Gauss- mm. 


13.3 Applications of BPM to photonic devices 


The development of new types of components for WDM applications requires accurate 
modelling before any fabrication attempts. Over the years many approaches and numerical 
methods have been developed. They can generally be divided into two main groups [8]: time- 
harmonic (monochromatic CW operation) and general time-dependent (pulse operation). 
The important role in numerical analysis of those components is played by BPM. 

There are many specific applications of the BPM to modelling different aspects of pho¬ 
tonic devices or circuits, such as passive waveguiding devices [9], channel-dropping filters 
[10], multimode waveguide devices [11], polarization splitters [12], multimode interference 
devices [13] and many more. 
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Propagation of Gaussian pulse in a waveguide by BPM. 



j_j Propagation of Gaussian pulse in a tapered waveguide by BPM. 


For a recent summary on integrated optics consult the recent book by Fhinsperger [14]. 

In this section we show the results of applications of the BPM developed earlier to the 
simplest two waveguiding structures which are the building blocks of WDM devices. Here, 
we consider a rib waveguide and a tapered waveguide. In Figs. 13.9 and 13.10 we show 
propagation of a Gaussian pulse and also profiles of the refractive indices. The MATLAB 
code used to generate those figures is provided in the Appendix, Listings 13A. 1 and 13A.2. 

To avoid reflections at the boundaries, transparent boundary conditions were imple¬ 
mented. 
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Appendix 13A 


13.4 Projects 


1. Function yjunction.m provided in the Appendix can be used to create a Y-junction. 
Use this function and other BPM routines to develop a program which will propagate 
a Gaussian pulse in the Y-junction. Analyse various configurations from the literature. 
Consult Refs. [15] and [16]. 

2. Implement the method of finding propagation constants using BPM. Consult Refs. [17] 
and [18]. 

3. Write MATLAB code to analyse propagation of Gaussian pulses in a coupled rib wave¬ 
guide. Determine coupling length. Consult Refs. [19] and [20], 

4. Analyse a fibre Bragg grating filter using BPM. 


Appendix 13A: MATLAB listings 


In Table 13.1 we provide a list of MATLAB files created for Chapter 13 and a short 
description of each function. 


Listing 13A.1 Program bpm wg.m. Describes propagation of a Gaussian pulse in a 
waveguide using BPM. 


% File name: bpm_wg.m 

7. Driver function which propagates Gaussian pulse 

% in a strip waveguide using BPM with transparent boundary conditions 
clear all 

x_0 = 1; 7, center of Gaussian pulse 

L_x=10.0; 7. transversal dimension (along x-aj 

w_0=1.0; 7. width of input Gaussian pulse 

lambda =1.6; 7. wavelength 

k_0=2*pi/lambda; 7. wavenumber 


| Table 13.1 List of MATLAB functions for Chapter . 1 

Listing 

Function name 

Description 

13A.1 

bpm-wg.m 

Gaussian pulse in a waveguide with TBC 

13A.1.1 

wgstruct.m 

Constructs waveguiding structure used by bpm w g.m 

13 A. 1.2 

step.m 

Propagates BPM solution along one step 

13A.2 

bpmJaper.m 

Gaussian pulse in a tapered waveguide with TBC 

13A.2.1 

taper struct, m 

Constructs tapered structure used by bpm,aper.m 

13A.3 

plot-y.m 

Plots Y-junction 

13A.3.1 

yjunction.m 

Function defines Y-junction 
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N_x=128; 

Delta_x=L_x/(N_x-l); 
h=5*Delta_x; 

N_z=100; 

plot_index=zeros(N_x,N_z); 
plot_field=zeros(N_x,N_z); 
x=linspace(-0.5*L_x,0.5*L_x,N_x); 

E=exp(-((x - x_0)/w_0)."2); 
ref_index = wg_struct(x); 
n_eff = 1.40; 


°/„ points on x axis 

% x axis spacing 

"/, propagation step 

*/„ number of propagation steps 

"/, storage for plotting ref. index 

% storage for plotting field 

X coordinates along x-axis 

X initial Gaussian field 

"/, structure is uniform along z-axis 

*/„ assumed value of prop, constant 


z = 0; z_plot = zeros(N_z); for r=l:N_z 
z = z + h; 
z_plot(r) = z + h; 
plot_index(:,r)=wg_struct(x); 
plot_field(:,r)=abs(E).*2; 

E = step(Delta_x,k_0,h,ref_index,n_eff,E); 


for k = l:N_z/10:N_z % choosing 2D plots every 10-th step 

plot(x,plot_index(: ,k) , ’LineWidth ’, 1.2) 
plot(x,plot_field(: ,k) , ’LineWidth 1 ,1.5) 
hold on 

end pause close all 

X 

for k = l:N_z/10:N_z ’/, choosing 3D plots every 10-th step 

y = z_plot(k)*ones(size(x)); % spread out along y-axis 

plot3(x,y,plot_index(: ,k) ,'LineWidth’,1.2) 
plot3(x,y,plot_field(:,k),’LineWidth’,1.5) 
hold on 

end grid on xlabel(’x’) 

ylabel(’z’) X along propagation direction 

pause close all 


Listing 13A.1.1 Program wg_struct.m. Constructs a waveguiding structure. 


function ref_index = wg_struct(x) 

% Construction of the waveguide structure used by BPM 
width = 2.0; 7, film width 


n_c =1.48 
n_s = 1.49 
n_f =1.52 


X refractive index of cover 
X refractive index of substrate 
l refractive index of film 


ref_index = n_s*(x<0)+n_f*((x>=0)&(x<=width))+n_c*(x>width); 
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Listing 13A.1.2 Function step.m is slightly modified from the function of the same name 
used in Chapter 12. 

7. File name: step.m 

function E_new = step(Delta_x,k_0,h,ref_index,n_eff,E_old) 

°/ 0 Function propagates BPM solution along one step 

N_x = size(E_old, 1) ; 7. determine size of the system 

last_term = k_0~2*Delta_x~2*(ref_index.~2 - n_eff~2); 

prefactor = l/(2*k_0*n_eff*Delta_x~2); 

main = ones(N_x,l) - 0.5*last_term; above = ones(N_x-l,l); below = 
above; 

P = prefactor*(diag(above,-l)-2*diag(main,0)+diag(below,l)) ; 7. matrix P 

7. 

L_plus = eye(N_x) + 0.5i*h*P; "/, step forward 

L_minus = eye(N_x)-0.5i*h*P; i step backward 

% 

7.-Implementation of boundary conditions 

7. 

pref = 0.5i*h/(2*k_0*Delta_x~2); 
k=li/Delta_x*log(E_old(2)/E_old(l)); if real(k)<0 
k=li*imag(k); 

end; 

left = pref*exp(li*k*Delta_x) ; °/„ left correction for next step 

L_plus(l) = L_plus(l)+left; L_minus(l) = L_minus(l)-left; 

7. 

k=-li/Delta_x*log(E_old(N_x)/E_old(N_x-l)); if real(k)<0 
k=li*imag(k); 

end; 

right = pref*exp(li*k*Delta_x) ; °/„ right correction for nest step 

L_plus(N_x) = L_plus(N_x) + right; L_minus(N_x) = L_minus(N_x) - 
right; 

% 

E_new = L_minus\L_plus*E_old; 7« determine new solution 

Listing 13A.3 Function plots a Y-junction. 

7. File name: plot_y.m 
7. Plots y-junction 
clear all 
L_x=10.0; 

N_x=100; 


7. computational window along x-axis 
7. number of steps along x-axis 
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L_z = 20; N_z = 100; 
h = L_z/N_z; 

plotting=zeros(N_x,N_z); 


7. step size 

7. storage for plotting 


z=0; 

% 

for i=l:N_z 

z = z + h; 
z_plot(i) = z + h; 
x = linspace(-L_x,L_x,N_x); 
plotting(:,i)=y_junction(z,x,N_x,N_z); 

end 

% 

for k = l:10:N_z ’/, choosing 3D plots every 10-th step 

y = z_plot(k)*ones(size(x)); '/, spread out along y-axis 

plot3(x,y,plotting(:,k),',’LineWidth’,1.0) 
hold on 

end xlabelC’x’); grid on pause close all 

Listing 13A.3.1 Function defines a Y-junction. 

function index = y_junction(z,x,N_x,N_z) 

7. Construction of the Y-junction structure used by BPM 
7« z - coordinate along propagation direction 
7o x - perpendicular coordinate 

7« Structure is described in only 2D, so we do not need cover 

7. 

w = 2.0; 7o film width 

n_s =1.0; % refractive index of substrate 

n_f = 1.52; % refractive index of film 

a = 10; b = 20; 

7» Construction of Y-junction 
slope =0.1; 

% 

7oindex = zeros(N_x,N_z) ; 

upper_l = w/2 + slope*z; upper_2 = slope*z; lower_l = - slope*z; 
lower_2 = -w/2- slope*z; 

7. 

n_l = n_s*(x<-w/2)+n_f*((x>=-w/2)&(x<=w/2))+n_s*(x>w/2); 

n_2 = n_s*(x<=lower_2)+n_f*((x>=lower_2)&(x<=lower_l))+... 

n_s*((x>=lower_l)&(x<=upper_2))+n_f*((x>=upper_2)&(x<=upper_l))+... 

n_s*(x>upper_l); 


index = ((0<z)&(z<=a))*n_l+((a<z)&(z<=b))*n_2; end 
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Optical link 




In the present chapter we will combine some of the methods developed earlier to create 
the simple point-to-point optical simulator which represents the simplest photonic system. 
It involves the transmitter, optical fibre and receiver. Some issues of how to quantify the 
quality of transmission in such a system will also be reviewed. Performance evaluation and 
tradeoff analysis are the central issues in the design of any communication system. Using 
only analytical methods, it is practically impossible to evaluate realistic communication 
systems. One is therefore left with computer-aided techniques. 

In the last 10-15 years the design of photonic systems has moved from the back-of-the- 
envelope calculations to the use of sophisticated commercial simulators, see for example, 
products advertised by Optiwave, like OptiSystem [1], by RSoft Design Group the Optical 
Communication Design Suite [2] and by VPI Photonics line of products [3], to just name 
a few important players. They contain sophisticated physical models and allow for rapid 
assessment of new component technologies in the system under design. 

Around 1995 the design of optical communication systems (operating over medium 
distances) would involve only a balance of power losses and pulse spreading. Later on, 
the demand on billion-dollar systems required complex analysis during the design process. 
This in turn created the need for sophisticated simulators. 

Computer simulations can quickly provide answers to several important questions essen¬ 
tial to every engineer designing optical communication system, like: what repeater spacing 
is needed for a given bit rate, or what is the required power generated by a transmitter? 

There exists extensive literature on the subject of modelling and simulation of fibre optic 
communication systems. Early extensive work on the description and calculation of the 
transmission properties of optical fibres for communications was published by Geckeler 
[4], There were several books published by van Etten and van der Plaats [5], Einarsson [6], 
Liu [7], Lachs [8], Keiser [9] and Palais [10], to list a few. 

Literature published in journals is also extensive. Some of the relevant journal publica¬ 
tions are: [11], [12], [13], [14], [15], [16], [17], [18], [19], [20]; that includes statistical 
design of long optical fibre systems is [21], A general review intended for wide audience 
was written by Lowery [22], 


14.1 Optical communication system 
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A generic optical link consisting of a transmitter, optical fibre and receiver was shown in 
the Introduction, Fig. 1.1. In that figure we showed a transmitter which employs a laser 
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Transmitter Channel Receiver 



More realistic optical link. 


Optical receiver Optical transmitter 



Amplifier 


Photonic regime Electronic regime Photonic regime 


Principle of regeneration of optical signals. 0-E means optical to electronic; E-0 means electronic to optical. 


diode to generate light, a channel which is formed by an optical fibre and a receiver whose 
main element is a photodetector. Actual systems are more complicated. 

In Fig. 14.1 we show a block diagram of a digital lightwave system (after Lima et al. 
[23]). Several detailed elements of the receiver are displayed. Some of them were already 
discussed in the earlier chapters. In the figure we also show notation for signals used to 
characterize operation of the particular elements. 

Sequence A k denotes the data values which are independent and identically distributed, 
Hf{co) is the transfer function of the optical fibre and B k is the output data sequence created 
hy the receiver. 

When light is transmitted over long distances, optical pulses degenerate due to various 
effects discussed previously. As a result, maximum transmission distance is limited. In 
order to increase that distance, repeaters are installed. Typical structure of the repeater is 
schematically shown in Fig. 14.2. An optical pulse is converted into electrical form, then 
amplified and regenerated and then converted back to light. 

Such a process is costly and has also other limitations. Intensive research is conducted 
to have this process done entirely in the optical domain without conversion to an electronic 
regime. 

Regenerators are devices which regenerate signals by removing noise and distortion after 
amplification. The process is normally possible only in digital systems. It serves 2R or 3R 
purposes. In general, the following terminology has been established: 

• 1R (Re-amplification only) - signal is amplified only. 

• 2R (Re-amplification and Re-shaping). Re-shaping corrects a pulse’s shape. 
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Table 14.2 Typical values of losses for various components. j 

Type of element 

Losses [dB] 

Connector 

0.1-0.2 

Splice losses 

0.3 


• 3R (Re-amplification, Re-shaping and Re-timing). Re-timing corrects the time drift in 
an optical pulse. 

As a result of the regeneration process, clean digital pulses are created. The distance 
between repeaters varies, typically 50-70 km. 


14.2 Design of optical link 


There is a vast amount of literature on the design of optical communications links. Some of 
the useful papers and books are: Carnes et al. [24], Pepeljugoski and Kuchta [25], Iannone 
et al. [26], Binh [27], Mortazy et al. [28], Zheng et al. [29], Lowery et al. [30], Sabella et 
al. [31], de Melo et al. [32], 

The design of the optical link involves several interrelated variables which describe 
operating characteristics of transmitter, fibre and receiver. The key practical requirements 
are: transmission distance, data rate and bit error rate. 

The simplest approach to the design of optical link is based on the evaluation of power 
budget and rise time budget [33]. Power budget analysis is needed to ensure that the system 
will work over the proposed distance with given components. The rise times of the source, 
fibre and the detector will determine the bandwidth available for transmission. 

In the tables we will summarize losses in typical optical components. Losses of 
a single-mode fibre from Coming Glass Works at various wavelengths are summa¬ 
rized in Table 14.1 [33]. Typical values of losses of other components are provided in 
Table 14.2. 













334 


Optical link 


Transmitter P-D 
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HU- 


A 

-D-q 



Distance from 
transmitter 


Illustration of power budget. 


14.2.1 Power budget analysis 


In the analysis of power budget one determines the total allowed optical power loss between 
the transmitter and the receiver. Typical components which contribute to losses are: fibre, 
connector, splices, couplers and splitters. Typical values of losses in standard components 
are summarized in Table 14.2. 

In Fig. 14.3 we show a typical power budget which illustrates loss of power over existing 
components as a function of distance from transmitter. In our example losses are created 
within connectors, splices and within an optical fibre. 

In calculating power budget one considers passive and active components of the link. 
Passive loss is made up of fibre loss, connectors, splices, couplers and splitters, among 
others. An example of a system power budget is illustrated in Table 14.3. 

In this example, one has an excess power margin of 29 dB - 28 dB = 1 dB (total 7 dB 
with the included system margin). As a general rule, the link loss margin should be greater 
than 3 dB to allow for link degradation over time, ageing of transmitters, etc. If during 
operation cables are accidentally cut, excess margin is needed to accommodate splices for 
restoration. 


14.2.2 Rise time budget 


The rise time budget is needed to determine whether the link will operate at the required 
bit rate. The bit rate is mostly limited by the dispersion. As explained earlier, dispersion 
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Table 14.3 Illustration of power budget determination. ! 

Transmitter 


Output power 

— 13 dBm 

Receiver 


Rx sensitivity 

-42 dBm 


Margin 

29 dBm 

System loss 


Fibre (3.5 x 5 km) 

17.5 dB 


Connector (1 dB x 2) 

2 dB 


Splicing (0.5 dB x 5) 

2.5 dB 


System margin 

6 dB 


Total 

28 dB 


is responsible for pulse broadening as it travels through the fibre. Those effects introduce 
rise time due to the material dispersion (x mat ) and modal dispersion (x mtxi ). Additionally, 
the response of the transmitter and the receiver must also be included. Therefore, total link 
rise time of the system is determined by the rms sum of all the rise times [33]: 


x sys = ^ + + + 4 


(14.1) 


For two main data coding schemes, namely RZ and NRZ, a general rule is that the system 
rise time should be determined as 


(14.2) 

(14.3) 


where B is the bit rate. The rise times of the optical transmitter and receiver are known 
from the manufacturer. The rise time for multimode step index fibre is determined as [33] 
(subscript im stands for intermodal) 


For material dispersion 



L 

c 


(14.4) 

x mat — 85Z.AAp.s- 

(*0~ 

850 nm) 


0.5Z,A2ps 

(*0~ 

1300 nm) 

(14.5) 

~ 20Z.A2.ps 

(Ao~ 

1500 nm) 



where L is in km and A a in nm. 
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Table 14.4 Illustration of r 

ise-time budget. 


Component 



Rise time (ns) 


System budget (NRZ) 




1.75 

Light source 



1.0 


Fibre 

Photodetector 

Transit time 

0.5 

0.23 



Total 

1.3 

1.4 


System rise time 



1.75 

1.75 


For single-mode fibres 


x f ~D L- AA 


~ 2 • Z • AA (A-o ~ 1300 mn, A z ~ 1300 mu) 
~ 16 • Z • AA (Ao ~ 1550 mn, A z ~ 1300 nm) 
~ 2 • Z ■ AA (A 0 ~ 1550 nm,A z ~ 1550 nm) 


(14.6) 


with Z> ~ 2 ps/km-nm at 1300 nm and ~ 16 ps/km-nm at 1550 nm with fibres with 
A z ~ 1300 nm. Also D ~ 2 ps/km-nm at 1550 nm for fibres with A z ~ 1550 nm. Here AA 
is the source spectral width, Z is fibre length, A 0 is the operating wavelength and A z is the 
zero dispersion wavelength. 

Typical rise time budget calculations are summarized in Table 14.4 [10]. 


14.3 Measures of link performance 


There are various requirements which determine the performance of optical networks. One 
of them is network security, which will not be discussed here. 

The simplest optical network is formed by a point-to-point optical link. Performance of 
such a link can be determined by the amount of information received at the acceptable level 
of errors [34], Loss of information can be caused by a number of factors, of which some 
typical ones are (a) decrease in the amplitude of the received signal and (b) loss of timing. 

Signal quality is determined by analysing the so-called eye diagram and also signal-to- 
noise ratio (SNR). The main parameter determining the quality of a digital link is the bit 
error rate (BER). The smaller the bit error rate, the better the link transmission quality. 
Typical error rates for optical fibre telecommunication systems range from 10 9 to 10“ 12 
[35], They depend on the SNR at the receiver. BER was discussed in Chapter 10 dealing 
with receivers. Here, we only discuss the eye diagram. 
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Bit pattern 


Oscilloscope 

generator 

Test system 

o 

Trigger Data out 


Vertical Trigger 
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| Transmitter -- Receiver 


Trigger line 

Schematic of an experimental equipment setup for making eye diagram measurements. 



14.3.1 Eye diagram 


The eye diagram technique is a simple and convenient method of diagnosing problems 
with data systems. The eye diagram is generated in the time domain using an oscilloscope 
connected to the demodulated data, see Fig. 14.4. Pseudorandom data are generated and 
applied to the test system. After being transmitted through the test system, the data are 
applied to the vertical input of an oscilloscope. The oscilloscope is triggered at every 
symbol period (here using data from a separate line) or fixed multiple of symbol periods. 
As a result, the signal is retraced and superimposed on the same plot. Construction of an 
eye diagram for a 3-bit long sequence of data is shown in Fig. 14.5. 

The result is an overlap of consecutive received symbols which form an eye pattern on the 
oscilloscope. The eye diagram provides a lot of information about the system’s performance 
or about performance of its elements. 

A created sequence of Gaussian pulses with random component for a predefined logical 
combination is shown in Fig. 14.6. For this sequence, the eye diagram is shown in Fig. 14.7. 
MATLAB code used in the generation of those results is shown in Appendix, Listing 14 A. 1. 
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Sequence of six Gaussian pulses with a random term. The logical data sequence is: 010101. 



all pulses. 


14.4 Optical fibre as a linear system 


In this section we consider the optical fibre modelled as a linear system and illustrate 
propagation of signals in such a system. 

When an optical pulse propagates through optical fibre, it exhibits attenuation, delay 
and distortion. Delay can be handled through appropriate synchronization. Attenuation 
of a particular fibre determines the ultimate distance between transmitter and receiver. 
Distortion causes pulse broadening known as dispersion; it can produce situations when 
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(a) 

Sout(t) 



A. 


(b) 

Illustration of pulse transmission in an optical fibre represented as a linear system described by the impulse response 
function hit). (a) Schematic of optical fibre and its representation as a linear system, (b) pulse broadening during 
propagation. 


one cannot distinguish between pulses representing different logical information due to 
their overlap. 

Attenuation and dispersion determine practical limits on the information transmission 
performance of optical communication systems. Additional distortions, including noise, 
are produced in the receiver. 

Propagation of optical signal of frequency co through fibre of length L having attenuation 
coefficient a is schematically illustrated in Fig. 14.8. In a first approximation, the optical 
fibre is considered as a linear system with an impulse response hit). Its Fourier transform 
H(co) known as a transfer function is 


»-£ 


h{t) e 


(14.7) 


The fibre is not an ideal system. Therefore during the passage, the properties of the trans¬ 
mitted pulse described by its time function s{t) or its spectral function S(oj>) are distorted, 
see Fig. 14.8. (Spectral function S(oj) and the time function s(t) are related by the FT 
relation.) 

In the frequency domain, the pulse propagation is described by the transfer function 
H(co) as 

Hico) = (14.8) 

S in (co) 

where a(oj) is the system’s attenuation and </>(«) its phase. S out (&>) and S in (&>) are FT of 
output and input time functions of a propagating pulse. 

Some of the examples of spectral functions Hico) which are used to model pulse spectra 
and filter functions will now be summarized. These functions are easy to program. All are 
dimensionless, normalized low-pass functions approaching unity at v = 0 and zero for 
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v —y oo (we use the relation w = 2nv, where w is angular frequency and v is frequency). 
They are real functions and symmetric with respect to v = 0. The functions are [36]: 

a) Gaussian pulse 

H g (v) = exp (-j rvV) (14.9) 


b) Rectangular pulse of duration r 

sin (jtvt) 

Hrp{v) = - 

nvr 

c) Raised cosine spectrum 

Hrc{V) = COS 2 ^ (! + COS7TVT) 

d) Spectrum of a trapezoidal pulse of duration r and rise time t r 


(14.10) 


e) Spectrum with Nyquist slope 


n(l-2vr)((l-|2vr|f-l)] 


where N is the parameter which controls roll-off of this function, 

f) Receiver filter that equalizes 


#e(v) 


1 + COS 7TUT 

2 exp (— nv 2 x 2 Q) 


(14.14) 


We will now summarize a description of the three main elements of an optical link: 
transmitter, fibre and receiver. 


14.5 Model of optical link based on filter functions 


In a typical practical situation the optical system transmits pulses. As they propagate from 
the transmitter to the receiver, they broaden and also their slopes get distorted. A presence 
of noise in all of a system’s components also adds to the modification of propagating pulse. 

We start with a simple analysis where the whole system is modelled by a single function. 
Here, we only consider a rectangular pulse. 

14.5.1 Test analysis for a rectangular pulse 


In this section we present the analysis of propagation of a rectangular pulse. Although not 
a very realistic situation, it serves as a test. 

In Fig. 14.9 on the left we show the input rectangular pulse. It propagates through the 
system modelled as a rectangular filter (shown on right). The output after propagation is 
shown in Fig. 14.10. MATLAB code in shown in the Appendix, Listing 14A.2. 
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ie (a.u.) 

Rectangular pulse (left) and rectangular filter. 


-100 -50 0 50 100 

Frequency (a.u.) 



Time (a.u.) 


Rectangular pulse after regeneration. 


Transmitter Fibre Receiver 



Main elements of the transmission system. 

Next, we build a more realistic model of the optical link, which is done in the following 
section. We summarize simple models for all the main components that constitute optical 
fibre system and then develop a program which allows for the analysis of pulse propagation 
in such a system. 

Our approach is based on the work by Geckeler [36], Main parts of transmission system 
are shown in Fig. 14.11. Here, E 0 is the energy of the optical pulse produced by the 
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Table 14.5 Parameters of typical generated pulses. j 

Parameter 

Symbol 

Value 

Bit rate 

Ro 

50 Mbits/s -1 

Power 

P 0 

0.5 mW 

Normalized pulse width 

To 

0.7 


transmitter, ro is the width of that pulse and Ho(co) is the spectral function of the produced 
optical pulse. Detailed discussion of all the three elements now follows. 

14.5.2 Transmitter 


It is usually a semiconductor laser which can be directly modulated by changing bias 
current or by employing an external modulator. The main characteristics of light source are: 
wavelength, output power, modulation speed, spectral width, noise. For a directly modulated 
semiconductor laser, the performance is determined by frequency chirping (discussed in 
Chapter 7). 

Here we assume a particular optical pulse as produced by a transmitter. Optical pulses 
were discussed in Chapter 6. Of particular importance are rectangular and Gaussian pulses. 

Pulse produced by the transmitter in the time domain is described by the function po(t ). 
Its Fourier spectrum is 



Fourier spectrum is related to spectral function H 0 (co ) as 
tSb(u>) = Po,max ■ fi) ' Hq{(o) 

Parameters of the generated pulse are provided in Table 14.5. They refer to a system with 
a bit rate Ro = 50 Mbits/s -1 and an LED transmitter operating at 1.3 pm wavelength with 
a maximum launched power of 0.5 mW. 

14.5.3 Fibre 


The frequency spectrum of propagating pulse after it leaves fibre is obtained by multiplying 
pulse spectral function and fibre spectrum. 

Pulse propagation in the fibre is described by the product of two functions: D\ which 
describes losses and its transfer function H x (w). Losses are represented by 

A = — = 10 - i®s (14.15) 

Eo 

The function H\ (co) is a normalized filter function approximately expressed as a Gaussian 
low-pass filter: 




(14.16) 








343 


Model of optical link based on filter functions 



The parameter n can be determined from the measured bandwidth Si of a fibre. Fibre 
parameters are listed in Table 14.6. 

The fibre parameter z\ is related to the bandwidth as 


Tl 


1 

2B[ 


(14.17) 


Bandwidth is modelled after Geckeler [36] as 



(14.18) 


where L c is the mode coupling length and Bi is the bandwidth-length product. The ap¬ 
proximation (14.18) is valid for 0 < Li < 3 L c . The parameters Bi and L c are empirical 
parameters. Their values are summarized in Table 14.6. 


14.5.4 Receiver 


An optical receiver is described by the low-pass filter transfer function 

( 

with the parameter Ti. 

The signal spectrum at the receiver output is determined as [4] 


H] 


at \co\ < — 


H 2 (co) = - 1+ cos 


(co) = if • T ■ M • R • G • Hq(co) • H\ (cu) ■ H2(co) 


(14.19) 


(14.20) 


where M is the multiplication factor of an avalanche photodiode (if used), G is its amplifi¬ 
cation factor and 


is the mean photocurrent of a pulse in a time slot of duration T. 
The parameter of the receiver is provided in Table 14.7. 


(14.21) 


14.5.5 Implementation of link model 


The above approach has been implemented following work by Geckeler [36] and [4], MAT- 
LAB code is shown in Appendix, Listing 14A.3. Typical results are shown in Fig. 14.12. 
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0.8 
0.6 
0.4 
0.2 
0 

-0.2 

-0.4-‘ 1 2 - 

0 2 4 6 8 

Frequency (a.u.) 

|0 Spectrum of rectangular pulse before (dotted line) and after propagation (solid line). 



The signal spectrum generated by the transmitter is sin(n vxq)/( n vtq), which represents 
rectangular pulse of duration Tq. Input pulse is shown as a dotted line, the output signal 
after the receiver as a solid line. More details were provided by Geckeler in Refs. [36] and 

[4]- 


14.6 Problems 


1. Analyse spectral functions summarized in Section 14.4. Plot the time dependence of 
each function. Evaluate Fourier transform and plot frequency spectra. 

2. Analyse Gaussian pulse using the filter approach. 

3. Analyse transmission of a Gaussian pulse by a single-mode fibre system. Determine 
the output pulse width as a function of the input pulse width. 

14.7 Projects 


1. Write MATLAB code to construct an eye diagram for the program link.m. 

2. Add noise component to program link.m. Analyse the effect of noise on a system’s 
performance. 
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Function name 


Description 


14A.1 

14A.2 

14A.3 


bits_gen._eye.rn 

rectangular.m 


link.m 


Generates pattern of Gaussian pulses and eye diagram 
Rectangular pulse before and after filtering 
Simulation of optical link (rectangular pulse) 


Appendix 14A: MATLAB listings 


In Table 14.8 we provide a list of MATLAB files created for Chapter 14 and a short 
description of each function. 

Listing 14A.1 Function bits gen eye.m Function generates a 6-bits long pattern of Gaus¬ 
sian pulses with random terms and eye diagram. 

7. File name: bits_gen_eye.m 
% Purpose: 

7. Generates 6-bits long pattern of Gaussian pulses with random terms 
7. Based on generated sequence, eye diagram is created 

7, 

clear all 

bits =[010101]; 7. def. of bit’s sequence 

T_period = 50d-9; 7« pulse period [s] 

t_zero = 20d-9; 7. center of incident pulse [s] 

width = 6d-9; 7. width of the incident pulse [s] 

7. Generation of current pattern corresponding to bit pattern 
I_P = 0; 

N_div = 50; 7. number of divisions within each bit interval 

t = linspace(0, T_period, N_div) ; 7. the same time interval is 

7. generated for each bit 

g = exp(-0.5*((t_zero - t) /width) . ~2) ; 7« def. of a single Gaussian pulse 
1 

if bits(l)==0, I_p_l = rand(l)*(l- g) ; 7. generates first bit 

elseif bits(l)==l, I_p_l = rand(l)*g; 

end 

temp = I_p_l; 

number_of_bits = length(bits); 

for k = 2 :number_of_bits 7. generation of remaining bits 

if bits(k)==l 

A = rand(l)*g; 

elseif bits(k)==0, A = rand(l)*(l- g); 


A = [temp,A] ; 
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for k = 2:number_of_bits 

t = linspaceCO, T_period, N_div); 
t = [temp_t,(k-l)*T_period+t]; 
temp_t = t; 


h = plot(t,A,’LineWidth’,1.5); 

xlabelC’time [s] ’,’FontSize’ , 14) ; °/ 0 size of x label 

set(gca,’FontSize’,14); '/, size of tick marks on both axes 

pause 
close all 

y. 

y,=============== Eye diagram ============================ 

'/, put all bit plots on the first one 
t_eye = linspaceCO, T_period, N_div); 
hold on 

for m = 1:number_of_bits 

A_temp = A((l+(m-l)*N_div):(m*N_div)); 
e = plot(t_eye,A_temp); 

set(e, ’LineWidth’ ,1.5) ; "/, thickness of plotting lines 

set(gca,’FontSize’, 14) ; '/, size of tick marks on both axes 

close all 


Listing 14A.2 Function rectangular.m. Function which shows a rectangular pulse before 
and after filtering. 

'/, File name: rectangular.m 

°/ 0 Analysis of rectangular pulse after filtering 

l 

clear all 

hwidth = 10; "/. half-width of a pulse 

time_step = 0.001; °/„ time step 

range = 40; 

t = -range:time_step:range; °/ time range 

°/ 0 -Creation of rectangular pulse- 

y = (5/2)*(sign(t+hwidth)-sign(t-hwidth)); 
plot(t,y,’LineWidth’,1.5); 
xlabelC’time (a.u’,’FontSize’,14); 
ylabelC’Pulse amplitude’,’FontSize’,14); 
set (gca,’FontSize’, 14) ; °/„ size 

axis([-range range 06]); 


of tick marks 


both 
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7, % -Fourier transform of a rectangular pulse- 

y_shift = fftshift(fft(y)); 

N=length(y_shift); 
n=-(N-l)/2:(N-l)/2; 

l - Definitions of different filters - 

filter = (l/2)*(sign(n+100)-sign(n-100)); 

7.filter = exp(-pi*n*0.0001); 

7.filter = 0. l*cos(pi*n*0.005) . “2; 

7.f ilter = (1 + cos(0.05*n))/2; 
plot(n,filter,’LineWidth’,1.5); 
axis([-3*range 3*range 0 1.5]); 
xlabelCFrequency (a.u.)’,’FontSize’,14); 
ylabel(’Amplitude’,’FontSize’,14); 

set(gca,’FontSize’,14); 7, size of tick marks on both axes 

7,-Regeneration of signal using filtered output- 

y_reg=filter.*y_shift; 7, Filter original frequency spectrum 

y_reg_inv=ifft(y_reg) ; 7, Inverse Fourier transform 

plot (t,abs(y_reg_inv),’LineWidth’,1.3); 

axis([-range range 06]); 

xlabelCTime(a.u.)’,’FontSize’,14); 

ylabel(’amplitude’,’FontSize’,14); 

set (gca,’FontSize’, 14) ; 7, size of tick marks on both axes 

close all 


Listing 14A.3 Function link.m. Function models propagation of a rectangular pulse in 
an optical link. 

7, File name: link.m 
i Optical link simulations 

7, Follow code on p.133 of S. Geckeler (1983) paper 
7, No model of a transmitter 
7, No noise 
1 

clear all 

7.-Pulse produced by transmitter- 

R_0 = 50d5; 7. Bit rate of binary digital signal, 50 Mbit/s 

P_0 = 5d-4; 7, Max power of optical pulse, 0.5 mW 

T_0 = 0.7; 7. Normalized pulse duration = tau_0/T 

7.-Fibre parameters- 

L_1 = 15; 7. Fibre length, 15 km 

A_L = 3.0; 7. Attenuation coefficient, 3 dB/km 

D_1 = 10“ (-A_L*L_1/10) ; 7. Power ratio 

P_1 = P_0*T_0*D_l/2; 7. Mean optical power at 


the end of fibre 
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B_L = 5d8; "/. Bandwidth-length product, 500 MHz km 

L_C = 10; '/, Coupling length of a multimode fibre, 10 km 

B_1 = B_L/L_1 + B_L/(3*L_C); 7, Fibre bandwidth 

T_1 = R_0/(2*B_1); 

% - Receiver parameters - 

T_2 = 0.7; 

'/,============== System evaluation ====================================== 

'/, Output signal spectrum 


D_F = pi/(20*T_2); 


f = zeros(l,N); 
f_A = zeros(1,N); 
f_B = zeros(l,N); 
f_C = zeros(l,N); 

H_F = zeros(1,N); 

H_F(1) = 1.0; 
for j = 1:N 

f(j) = j*D_F; 

f_A(j) = sin(f (j)*T_0)/(f ( j)*T_0) ; 
f_B(j) = exp(-f(j)~2*T_l*T_l/pi); 
f_C(j) = (1/2)*(1 + cos(f (j)*T_2)) ; 
H_F(j) = f_A(j)*f_B(j)*f_C(j) ; 


7, transmitter 
"/, fibre 
7, receiver 


7 

plot(f ,f_A, ’ . ’ ,f ,H_F, ’LineWidth’ ,1.5) */, Plot signal after propagation 

xlabel(’Frequency (a.u.)’,’FontSize’,14); 

set(gca,’FontSize’,14); °/ 0 size of tick marks on both sixes 

pause 

s = ifft(H_F); 
s = s/max(s); 
ff = linspace(0,2/3,N); 


plot(fftshift(abs(s)),’LineWidth’,1.5); 

xlabel(’time’,’FontSize’,14); 

ylabelC’time output function’,’FontSize’,14); 

set(gca, ’FontSize’ ,14) ; °/ 0 size of tick marks on both axes 

grid 

pause 

close all 
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Optical solitons 




In this chapter we concentrate on optical solitons and their propagation in optical fibre. 
They are pulses of a special shape and owe their existence due to the presence of dispersion 
and nonlinearity in optical fibre. They are able to propagate ultra-long distances while 
maintaining their shape. 

Here, we provide some basic knowledge for understanding the underlying physical 
principles of soliton creation and propagation. We generalize the theory of linear pulses 
developed in Chapter 5 and include the nonlinear part of polarization. We will then derive 
a nonlinear equation which describes propagation of solitons in optical fibre. 

Soliton propagation in an optical fibre was first demonstrated at Bell Laboratories by 
L. F. Mollenauer, R. S. Stolen and J. P. Gordon. We refer to the book by Mollenauer and 
Gordon [1] for the description of basic principles and a brief history of solitons. 

Some other relevant books on optical solitons and also on applications in optical com¬ 
munications are [2], [3], [4], [5], [6], 


15.1 Nonlinear optical susceptibility 


Solitons exist due to nonlinearity and dispersion. Dispersion was discussed earlier in the 
book. Here, we will concentrate on nonlinear effects. Optical responses including nonlinear 
effects are described as [7] 

Pit) = e 0 {x * (1) £(0 + x (2) £ 2 (0 + x Q) E\t) + • • •} (15.1) 

= P m (t) + P (2) (t) + P 0) it)+ ■■■ 

where we have expressed polarization P(t ) as a power series in the field strength E (t ). The 
quantities x (l \ x (2 \ x <3) are known as susceptibilities; x (l) is a linear susceptibility and 
X <2 \ x (3) are known as the second order- and third-order nonlinear susceptibilities. 

For a typical solid-state system x (1) is of the order of unity whereas x <2) is of the order of 

1 /E at and x (3) of the order of 1 /E 2 t , where E at = e/(4jteoal) is the characteristic atomic 

electric field strength and ao — 4ir eoti 2 /me 2 is the Bohr radius of the hydrogen atom. 
Explicitly [7] 
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X (2) ~ 1.94 x 10 ~ 12 m/V 
X (3) ~ 3.78 x 10 _24 ot 2 /F : 
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Formal expression for the third-order susceptibility is [7] 

P,(oJo + (On + (D m ) = 6q D Y, Xitel (o>0 + (O n + (O m , COq, 0)„, CO m ) 
jkl 

x Ej((Oo) E k (oj„) Ei(oj m ) 

where i, j, k, l refer to the Cartesian components of the fields and the degeneracy factor D 
represents the number of distinct permutations of the frequencies wq, co„, w m . 

y^ (j = 1,2,...) is the j-th order susceptibility. The linear susceptibility y (l) con¬ 
tributes to the linear refractive index no (real and imaginary parts; the imaginary part 
being responsible for attenuation). The second-order susceptibility y (2) is responsible for 
the second harmonic generation. For SiO2 the second-order nonlinear effect is negligible 
since Si02 has the inversion symmetry. Therefore optical fibres normally do not show the 
second-order nonlinear effects. 

The third-order susceptibility y (3) is responsible for the lowest order nonlinear effects 
in optical fibres. Generally, it manifests itself as the change in the refractive index with 
optical power or as a scattering phenomenon. It is linked with the optical Kerr effect, four- 
wave mixing, third-harmonic generation, stimulated Raman scattering, etc. An excellent 
discussion of third-order optical susceptibilities has been reported by Hellwarth [8]. 

Assuming linear polarization of propagating light and neglecting tensorial character of 
yjy, one finds the following relation for the nonlinear polarization: 

P NL (a>) = 3e 0 y (3) (co = co + co - co) \E(co)\ 2 E(co) 

Total polarization, which consists of linear and nonlinear parts, is written as 
P(fo) = e 0 X m E(co) + 3eoX 0) \E(co)\ 2 E(co) 

= ^0 XeffE((0) 

The effective susceptibility is field dependent as 

y <# = y (1) + 3y®|£(o>)| 2 
and it is linked to refractive index as 

n-rfl + y® =n 0 + n 2 I (15.2) 

Here / denotes the time-averaged intensity of the optical field. We start with the discussion 
of the main features of nonlinear effects. 


15.2 Main nonlinear effects 


15.2.1 Kerr effect 


It was discovered by J. Kerr in 1875. He found that a transparent liquid becomes dou¬ 
bly refracting (birefringent) when placed in a strong electric field. Generally, Kerr effect 
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(b) 


E(z,oo) 


Aco 



fflo 


Illustration of the propagating modulated pulse (a) and its spectrum (b). 


describes situations where refractive index depends on electric field as 
n ( w, \E \ 2 ) = n 0 (co) + n 2 ( co)E\ 2 


Here, n 2 is known as Kerr coefficient and it is related to susceptibility as 


for a linearly polarized wave in the x direction. For silica its value is approximately 1.3 x 
10 -22 m 2 /V 2 . Kerr effect originates from the non-harmonic motion of electrons bound in 
molecules. Consequently, it is fast effect, the response time of the order of 10 -15 s. 


15.2.2 Stimulated Raman scattering 


Scattering phenomena are responsible for Raman and Brillouin effects. During those 
scatterings, the energy of the optical field is transferred to local phonons: in Raman 
scattering optical phonons are generated whereas in the Brillouin scattering the acoustic 
phonons. 


15.3 Derivation of the nonlinear Schrodinger equation 


Solitons in optical fibres are described by the so-called nonlinear Schrodinger (NSE) 
equation, which will be now derived. In the derivation we use the concept of the Fourier 
spectrum of the propagating pulse, see Fig. 15.1. 

A medium where solitons propagate exhibits Kerr nonlinearity. In such a medium, 
refractive index depends on intensity of electric field 7(f) given by Eq. (15.2), again 
reproduced here: 


n(t) = n 0 + «2 • 7(0 


(15.3) 
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where [7] 


1(f) = 2n 0 e 0 c\A(z,t)\ 2 


(15.4) 


Here A (z, t) is the slowly varying envelope connected to the optical pulse described by the 
optical pulse E(z, t ) as, see Fig. 15.1: 


E(z, t ) = A(z, t) 


(15.5) 


Fourier transform of the optical field is [3], [9] 



(15.6) 


where E (z, to) is the Fourier spectrum of the pulse, ft propagation constant and &> 0 the 
frequency at which the pulse spectrum is centered (also known as carrier frequency), see 
Fig. 15.1. 

For quasi-monochromatic pulses with Aa> = &> — «o <SC 6>o, it is useful to expand 
propagation constant fi (co) in a Taylor series: 



(15.7) 


where we have neglected higher order derivatives. Here A0 NL = n 2 koI is the nonlinear 
contribution to the propagation constant. 

Substitute expansion (15.7) into Eq. (15.6): 



x exp yitAu) — ifi\zAco — -ifczAco 2 — i-z■ Afi NL 
= gdwof-Az) A(z,t) 


where we have introduced 



(15.8) 


Our next step is to obtain a differential equation describing evolution of the amplitude 
A(z, t ) from Eq. (15.8) which is in the integral form. To do this, one needs to take partial 
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derivatives of Eq. (15.8). One obtains 
3 A(z, t ) 


- ’ ^ — [ d (Aw) E (z, co 0 + A co) iAco e ,g( - z '‘ 

at J- oo 


d 2 A(z, t) 
dt 2 


-s: 


d (Aft>) £ (z, &>o + A&)) (r Am) 2 


9A ^ ^ = J d(Aco) E(z, co 0 + Aw) Aco - ^r&Aft) 2 - i ■ Afi NL ^ e !g(z ’ 0 

When evaluating time derivatives, we have assumed in the above that Hi) does not depend 
on time. Addition of the above combination of derivatives produces 
3 A(z,t) dA(z, t) 1 3 2 A(z,t) 

— J d( Aft)) E(z, coo + Aw) 

x Aft) - Up 2 Aaj 2 - i ■ A/3, v/ ,j + fipAco - /^ 2 (/Aft)) 2 j x e‘ 8(z ’ n 


The term in the bracket is, [• • • ] = —i • A/3 ; y/, = — in 2 koI. Eq. (15.9) thus gives 
3 A(z,t) , . 3 A(z,t) .1 3 2 A(z,t) 

3 z +/31 3t *2^ 2 3/ 2 

d{Aco) E(z, ft) 0 + A®) (—m 2 koI) e' 8<z ' t> 


= —in 2 koI J d{A(d)E{z,co)e ig(: ' t) 


m —in 2 koI A{z, t ) 

when using (15.8). The final equation describing solitons is therefore [3], [10] 
3 A(z,t) dA(z,t) 1 3 2 A(z,t) 7 a 

44 + ^44 + * 2*^4 = * 01 ^ ° - 2^’ 0 

where we have defined nonlinear coefficient y (after [3]) as 
2jtn 2 
kA e ff 


y = \ 


(15.11) 


Here A e g is the effective core area. 

Our interests here lie in the pulse evolution during propagation and not in the time of pulse 
arrival. We can therefore simplify the above equation by transforming it to a coordinate 
system which moves with group v g . In this moving frame, new time T and new coordinate 
Z are 


Z = z 
T = t-piz 


(15.12) 
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To obtain the transformed equation, we must evaluate derivatives with respect to new 
variables as follows: 

dA _ dA dT dA dZ _ dA 
~dt ~ dT~dt + d\Z~dt ~ dT 

since = I and = 0. From the above one finds 

d 2 A _ d 2 A 
~dt 2 ~df 2 

Using the above results, one has 

dA _ dA dT dA dZ _ dA dA 
~dz ~ dT~dz + d\Z~dz ~~^dT + d\Z 

The last result is used in Eq. (15.10) to replace The transformed equation is 


~^ +i \^ 2 \fi~ i y\ A \ 2A + \ 0lA = Q (15.13) 

where in the final step we replaced Z by z. It is known as a nonlinear Schrodinger equation 
(NSE). 

To further analyse NSE, we will introduce two characteristic lengths describing dispersion 
(L/j) and nonlinearity (L nl ). Those are defined as 


L _ T o _ T o 

D m \d\x 2 


(15.14) 


and 


(15.15) 


where Pq is the peak power of the slowly varying envelope A(z,T ) and To is a temporal 
characteristic value of the initial pulse, which is often defined as full width half maximum 
(the pulse 3 dB width). Those two lengths characterize how far a pulse must propagate to 
show the respective effect. Physically, L D is the propagation length at which a Gaussian 
pulse broadens by a factor of V(2) due to group velocity dispersion (GVD). 

GVD dominates pulse propagation in fibres whose length L is L Ly/. and L > L D . 
In such situation, the nonlinearity in NLSE can be ignored and the equation can be solved 
analytically. Nonlinear effects dominate in fibre where L L D and L > L nl . In this limit 
the dispersion term can be ignored. 

Typical values of parameters used in the simulations are summarized in Table 15.1. 

For numerical analysis normalize variables 


u =wA “ d ’=% (15 - 16) 

The width parameter T 0 is related to the full-width at half-maximum (FWHM) intensity of 
the input pulse. Specifically 


T s = 2T 0 ln{\ + 7(2)) « 1.763T 0 


(15.17) 
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1 Table 15.1 Typical parameters used in simulation of solitons. | 

Parameter 

Symbol 

Value 

Unit 

wavelength 

X 

1.55 

qm 

nonlinear coeff. 

Y 

1.3 

1/kmW 

GVD 

h 

15 x 10“ 24 

s 2 /km 

width parameter 

To 

100 

ps 

peak power 

Po 

0.15 

mW 

losses 

a 

0.01 

dB/km" 1 

chirp parameter 

c 

1.2 

dimensionless 

soliton period 

Zo 

1047.2 

km 


After simple algebra, Eq. (15.13) takes the form 
dU .signal) d 2 U 


1 


(15.18) 


dz 2 L d 

Another normalized form of the Schrodinger equation exists in the literature. We obtain it 
in the lossless case, i.e. with a — 0. To derive it, normalize the z coordinate as follows: 


r + iN 2 \U\ 2 U = 0 


After a few algebraic steps, one obtains 

dU signal) d 2 U 

a* 

where N is known as soliton order and is defined as 

Lp _ yP 0 T 2 
Lnl \fh\ 

The last popular form of the NLSE is found by introducing u a: 

Eq. (15.20) then takes the form 

3 u sign(fh) d 2 u 


N 2 = - 


= NU = 


dr 2 


- i\u\ 2 u = 0 


(15.19) 

(15.20) 

(15.21) 

(15.22) 

(15.23) 


15.4 Split-step Fourier method 


In this section we will discuss the numerical solution of the nonlinear Schrodinger equation 
(NSE) which describes propagation of optical solitons using the so-called split-step Fourier 
method (SSFM). 
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(a) 


Aj A 2 A 3 A, A 5 A« A 7 Ag A, A 10 


Ai A 12 


nonlinear 



hi 2 

A i 



Illustration of split-step Fourier method, (a) Division of optical fibre into A regions (here A =11) of equal lengths, 
(b) Illustration of operation of linear and nonlinear operations at arbitrary segments. 


The SSFM is a numerical technique used to solve nonlinear partial differential equations 
like the NSE. The method relies on computing the solution in small steps and on taking into 
account the linear and nonlinear steps separately. The linear step (dispersion) can be made 
in either frequency or time domain, while the nonlinear step is made in the time domain. 
The method is widely used for studying nonlinear pulse propagation in optical fibres. For 
more detail see [11] 

A nonlinear Schrodinger equation, Eq. (15.13) contains dispersive and nonlinear terms. 
To introduce SSFTM, write the NLSE equation in the following form: 


M|£) = (E +S) ^, n 


where 


LA = 


a 

2 


d 2 A 

df 2 


contains losses and dispersion in the linear medium and nonlinear term 


(15.24) 


(15.25) 


NA = iy\A\ 


2 A 


(15.26) 


accounts for the nonlinear effects in the medium. 

The basis of the SSFM is to split a propagation from z to z + h (h is a small step) into 
two operations (assuming that they act independently): during first step nonlinear effects 
are included and in the second step one accounts for linear effects, see Fig. 15.2. 
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Formal solution of Eq. (15.24) over a small step h is thus 

A(z + h, t) = t ) (15.27) 


In the first-order approximation, the above formula can be written as 

A(z + h,t) = efi<FA{z, t) + 0{h 2 ) (15.28) 


The basis of this approximation is established by Baker-Hausdorf lemma [12], which is 

//=/+* elMl (15.29) 


given that operators A and B commute with [A, B], 

The basis of the method is suggested by Eq. (15.28). It tells us that A(z + h, t) can be 
determined by applying the two operators independently. The propagation from ztoz + h 
is split into two operations: first the nonlinear step and then the linear step assuming that 
they act independently. If h is sufficiently small, Eq. (15.28) gives good results. 

The value of step h can be determined by assuming that the maximum phase shift 
</>max = y\A p \ 2 h, where A p is the peak value of A (z, t) due to the nonlinear operator is 
smaller than the predefined value. Iannone et al. [2] reported that 0 max < 0.05 rad. 

For a practical implementation of the SSFM, we need to establish practical expressions 
for dispersive and nonlinear terms. In the following we will therefore analyse the effect of 
both terms independently neglecting losses. 

Let us analyse the effect of the dispersive term alone. For that we temporally switch off 
the nonlinear term. After Fourier transform, the ‘linear equation’ becomes 


dA(z, co) 
dz 


2 


l fi 2 A(z, co) 


which has the solution 

A(z, co) = 1(0, co)e- io?hzl2 


The action of the nonlinear term alone is described by the equation 
8A ^ = iy\A(z,t)\ 2 A(z,t) 

The ‘naturaF solution is in the time domain. It produces 
A(z, t) = A(0, t)e iylAl2A 


15.4.1 Split-step Fourier transform method 


The propagation medium (say, cylindrical optical fibre) is divided into small segments, each 
of length h, see Fig. 15.2. Further, each individual segment of length h is subdivided into 
two of equal lengths. The linear operator operates over each subsegment in the frequency 
domain, whereas the nonlinear operator operates only locally at the central point. 
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Operation of the linear operator L, Eq. (15.25) over first subsegment is done as follows: 

^ t/2 A(z, t) = F~ x {e“ /2 F { A[z , t))} (15.30) 

i.e. one must Fourier transform original amplitude from time domain into frequency domain, 
apply linear operator L and then apply inverse Fourier transform to get the amplitude back 
to time domain. 

The operation of nonlinear operator defined by Eq. (15.26) is as follows: 

0 = 4+1/2 t) e h * (15.31) 


where A i+ yij, is the value of field amplitude at an infinitesimal point left from i+\. Finally, 
the operation of linear operator over second subsegment of length | is done exactly the 
same way as over the first segment. 

To summarize, the method over each segment of length h consists of three steps: 


step 1 


step 2 


step 3 


A i (z,co)=F{A i (z, 0} 

2,_(z, co) = A,(z, co) ■ exp {-i\oo 2 p 2 h) 
A i _(z,t)=F~ x {A i _( Z ,co)} 

A i+ (z, t) = Ai_ (z, t) • exp (iy |^f 
A i+1 (z, co) = F [A i+ i(z,t)} 

A i+l (z, co) = A i+ 1 (z, co) ■ exp (-i\co 2 p 2 h) 
,Ai+i (z,t) = F~ x [A i+ {z, co)} 


where F indicates Fourier transform (FT) and F 1 inverse FT. 


15.4.2 Symmetrized split-step Fourier transform method (SSSFM) 


The simulation time of Eq. (15.28) essentially depends on the size of step h. To reduce 
simulation time, a new method was invented which allows us to use larger steps, h. 
Mathematically, one uses the following second-order approximation: 

A(z + h, t) = e^ r exp + iV(z , )c/z , J e'^ ht A(z, t) + OQi 3 ) (15.32) 

In this approach one assumes that the nonlinearities are distributed over h, which is more 
realistic. For small h, one can approximately evaluate 

/ z + h ^ h ^ ^ 

N(zf)dz! - \N(z) + N(z + h)\ 

In the SSSFM algorithm Eq. (15.31) is replaced by 

A i+ 1/2,x(z, t) = A i+ 1AJ? (z, t) (15.33) 

The above scheme requires iteration to find A i+ \ since it is not known at z + ^h. Initially 
N(z + h) will be assumed to be the same as N(z). 
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Evolution in time over one soliton period for N = lsoliton. 



Evolution in time over one soliton period for N = 1 soliton with damping. 

15.5 Numerical results 


Based on the developed software, in this section we will report on numerical analysis of 
soliton propagation in optical fibre. The SSFM has been implemented and MATLAB code 
is provided in Appendix, Listing 1 5A. 1. We start with the single solitons. 

15.5.1 Single solitons 


In our analysis we used fibre and pulse parameters summarized in Table 15.1. The input 
soliton was considered to be in the form 


A in (£ = 0 ,T) = VAo seek 


© 


(15.34) 


where N is the soliton order given by Eq. ( 15.21 ). In Fig. 15.3 we illustrated the propagation 
of N = 1 soliton without damping and in Fig. 15.4 the evolution of the same soliton with 
damping (a = 0.01 dBkm -1 ). 
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A higher-order soliton with N = 3 is defined as 

A in (f = 0 ,T) = N 3 VAo seek f ^ (15.35) 

Propagation of higher-order soliton with N = 3 is shown in Fig. 15.5 (no damping) and in 
Fig. 15.6 (with damping). One can observe periodic evolution of the undamped soliton. 

15.5.2 Chirped solitons 


Here we analyse how chirp affects single soliton propagation. We assume the following 
input: 

Am (£ = 0, T) = ^/A^ sech ex P 



(15.36) 
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Evolution in time ofa chirped soliton for N = 1 without damping. 



Evolution in time of two interacting solitons without damping. 


where C is the chirped parameter. Evolution of such soliton in the case of N — 1 and 
C = 1.6 is shown in Fig. 15.7. The pulse is initially compressed and then broadens. 

15.5.3 Two interacting solitons 


The effect of nonlinearity produces mutual interaction between soliton pulses if they are 
launched close together [13]. This interaction is important from a practical point of view 
and also from a fundamental perspective related to soliton propagation. For example, it was 
shown [13] that nonlinear interaction between solitons can result in a bandwidth reduction 
by a factor of 10. See also the review in [14], 

First, we consider an interaction between two solitons of the same strength and N = 1. 
We assume the initial pulses of the form [15] 

A in (f = 0, T) = jAosech - 1^ + jA^sech ^ + 1^ (15.37) 

The result of two interacting solitons is shown in Fig. 15.8. 
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A I I A 


/\ 

Ail 


I Stream of solitons. 


15.6 A few comments about soliton-based communications 


As mentioned at the beginning of this chapter, an important application of solitons is in 
the transmission of information in optical fibre systems. Soliton pulses are stable when 
they propagate over long distances. Losses in fibres are an important limiting factor, so it 
becomes necessary to compensate periodically for fibre loss. This can be done by using 
EDFA. 

In the transmission of information, solitons are considered like linear pulses. In a bit 
stream each soliton has its own bit slot and it represents logical one or zero, see Fig. 15.9. 
Here Tb is the duration of the bit slot and it is related to bit rate B as 

Tb= X - = 2aT 0 (15.38) 

n 

where T 0 is the soliton width and 2 a specifies the distance between neighbouring solitons. 
To prevent their interactions, the neighbouring solitons should be well separated. 

Solitons are very well suited for long-haul communication because of their high 
information carrying capacity and the possibility of periodic amplification. However, 
soliton systems are still waiting for the full field deployment. The reader might con¬ 
sult recent publications on soliton-based optical communication systems [16], [17], 
[18], [19], 


15.7 Problems 


1. Implement a symmetrized SSFM based on Eq. (15.33). 

2. Generalize the implementation by including the third-order dispersion effect where 
ft :/ 0. Analyse the influence of ft on propagation of N = 1 and N = 3 
solitons. 

3. Implement SSFM for a normalized form of NLSE given by Eq. (15.23). Using your 
implementation, conduct the analysis for N = 1 and N = 3 solitons. 

4. Analyse the propagation of Gaussian pulses in fibres with nonlinearity using SSFM. 
Consider regular Gaussian and also chirped Gaussian pulses. 
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Appendix 15 A: MATLAB listings 


In Table 15.2 we provide a list of MATLAB files created for Chapter 15 and a short 
description of each function. 

Listing 15B.1 Program ssftw full.m. Here we conduct analysis of various solitons using 
the split-step Fourier transform method. 

% File name: ssftm_full.m 

% Split-step Fourier transfer method for the analysis of solitons 

7. Contains the following initial pulses 

7. N=1 soliton 

7. N=3 solitons 

% Chirp N=1 soliton 

l Two interacting solitons 

l For the analysis of the appropriate case it should be uncommented 

7 . 

clear all 


"/,===== Material parameters == 

7.alpha=0.01; 

alpha = 0.0; 

alpha=alpha/(4.343); 

gamma = 1.3; 

beta_2=15d-24; 

'/,====== Time parameters ===== 

T =- 512e-12:le-12: 511e-12; 
delta_t=le-12; 

"/,======= Input pulse and its 

T_0 = 100d-12; 

P_0 = 0.15d-3; 

A_0 = sqrt(P_0); 

L_D =((T_0~2)/(abs(beta_2))): 
L_NL = (1/(gamma*P_0)); 
h = 2d0; 

z_plot = (0:h:((pi/2)*L_D)); 
n = 1; 

z_0 = (pi/2)*L_D; 

N = 3; 


% Fibre loss (dB/km) 

7, Fibre loss (1/km) 

7, Fibre nonlinearity [1/W/km] 
7. 2nd order disp. (s~2/km) 


parameters ======== 

7. Initial pulse width (s) 
7. Input power (Watts) 


t Dispersion length i 


[km] 


l step size [km] 

7, z-values to plot [in km] 
7. controls stepping 
7. Soliton period [km] 

7. Soliton number 
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7.A = N~2*A_0*sech(T/T_0); 7. N=3 soliton 

7.A = A_0*secli(T/T_0) ; 7. N=1 soliton 

C = 1.6; 7. chirp parameter 

7.A = A_0*sech(T/T_0) .*exp((-li*C/2) .*(T/T_0) ."'2) ; 7. N=1 soliton with chirp 

A = A_0*sech(T/T_0-l.6)+A_0*sech(T/T_0+l.6) ; 7, interacting solitons 

7. 

L = max(size(A)); 

Delta_omega=l/L/delta_t*2*pi; 
omega = (-L/2:1:L/2-l)*Delta_omega; 

A_f = fftshift(fft(A)); 7, Pulse in frequency domain 

for jj=h:h:z_0 

A_f = A_f.*exp(-alpha*(h/2)-li*beta_2/2*omega.~2*(h/2)) ; 

A_t =ifft(A_f); / Pulse in time domain 

A_t = A_t.*exp(li*gamma*((abs(A_t)).“2)*(h)); 

A_f = fft(A_t); 

A_f = A_f.*exp(-alpha*(h/2)-li*beta_2/2*omega.~2*(h/2)); 

A_t = ifft(A_f); 
plottingC:,n) = abs(A_t); 
n = n+1; 

cputime=toc; 

7. 

z_plot = h:h:z_0; 7. creates distance for plotting 

length(z_plot) 

for k = 1:80: length(z_plot) 7. Choosing 3D plots every 10-th step 
y = z_plot(k)*ones(size(T ))\°l Spread out along y-axis 
plot3(T*ldl2,y,plottingC:,k),’LineWidth’,1.5) 
hold on 

end 

xlabelC’time (ps)’,’FontSize’,14); 
ylabel(’distance (km) ’,’ FontSize ’, 14); 

set(gca, ’FontSize’ , 14); 7. size of tick marks on both axes 

grid on 
pause 
close all 
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Solar cells 




In this chapter we provide the fundamentals of solar cell operations. We outline basic 
physical principles, summarize a model based on equivalent circuit and discuss ways to 
increase efficiency of single-junction solar cell by employing multijunction structures or 
by creating intermediate bands. 

General introductory books describing solar cells are [1], [2] and [3], Anderson [1] and 
Rabl [2] provide a practical background in solar-energy conversion including techniques 
for estimating the solar radiation incident upon a collector, fundamentals of optics for solar 
collectors, discussion of types of concentrators and economical analysis, among others. 
Moeller [3] concentrates on fundamental, physical and material aspects of semiconductors 
for photovoltaic energy conversion. 


16.1 Introduction 


World consumption of electric energy circa 2009 was around 12-13 TW [4], This energy 
was created by several methods. Let us look briefly at two of them: nuclear power and solar 
energy. 

Assuming that a single nuclear plant produces 1 GW of power, creation of 10 TW of 
power would require 10 000 nuclear power plants. This huge number will certainly create 
social problems. Also, the uranium needed for those plants could be diminished in less than 
20 years. 

As far as the solar energy is concerned, the Sun supplies more solar energy on Earth 
in one hour than we use globally in one year. To make another comparison, a football 
field covered with silicon solar cells with an efficiency around 17% at 1-Sun illumination 
produces approximately 500 kW of power. The USA uses about 3 TW, so to produce that 
amount one will need about 6 million football fields. However, by increasing efficiency 
of solar cells, for example by using multijunction devices and increasing concentration of 
solar energy, the area of semiconductor solar cells can be significantly reduced [5]. 

Those rough estimates indicate that solar energy can be a practical alternative as a source 
of electric energy, and also that increasing the efficiency of solar cells is important. One 
should, however, remember that the incidence of solar energy is not uniform. 

Solar irradiation spectrum (spectrum of solar energy) is shown in Fig. 16.1. This graph 
will be important later when we discuss in detail multijunction cells. In the figure we 
show spectral intensity, that is intensity of solar energy per unit wavelength. Above the 
Earth’s atmosphere it is denoted as AMO (air-mass zero). The integrated (over the whole 
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Solar irradiation spectrum. Reprinted and adapted with permission from C. H. Henry, J. Appl. Phys. 51, 4494 (1980). 
Copyright 1980, American Institute of Physics. 



Definition of air-mass. 


spectrum) intensity above the Earth’s atmosphere is constant and its approximate value is 
1.353 kW m 2 . It gives the total power flow through a emit area perpendicular to the direction 
of the Sun. 

The Earth’s atmosphere significantly absorbs radiation from the Sun. In Fig. 16.1 the 
radiation at the Earth’s surface is labelled as AM 1.5. In the figure several absorption bands 
and lines due to various effects are shown. For comparison, the spectrum of black body 
radiation at the temperature of 6000 K is also shown. One can notice that the solar spectrum 
above the Earth’s atmosphere (AMO) resembles black body radiation. 

The air mass coefficient defines the direct optical path length through the Earth’s at¬ 
mosphere, expressed as a ratio relative to the path length vertically upwards, i.e. at the 
zenith. 

Let us finish this section with an explanation of AM (air-mass) convention. Generally air 
mass m (AMm) is defined (see Kasap [6]) as the ratio of the actual radiation path h to the 
shortest path ho, m = h/ho = cos ©, see Fig. 16.2. 

Changing the light intensity incident on a solar cell changes all solar cell parameters, 
including the short-circuit current, the open-circuit voltage, the efficiency and the impact 
of series and shunt resistances. The light intensity on a solar cell is measured in the number 
of ‘ srrns’, where 1 sun corresponds to standard illumination at AM 1.5, or 1 kWm 2 . For 
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V 


n-type 


p-type 


A p-n junction operating as a solar cell. 


1 

T 


example, a system with 10 kW m 2 incident on the solar cell would be operating at 10 suns, 
or at 10X. 


16.2 Principles of photovoltaics 


The simplest solar cell (or photovoltaic (PV) cell) is a single p-i-n junction operating 
under forward bias. It is formed by using, e.g. properly doped silicon, see Fig. 16.3. It can 
produce potential difference, thus acting as a battery. It contains a very narrow and heavily 
doped n-region. It is covered with thin antireflecting coating to reduce sun reflection which 
penetrates p-n junction from n-side, see Fig. 16.4 for more details. 

Short wavelengths penetrate only the n-region, longer wavelengths penetrate depletion 
region of width w, and finally the longest wavelengths reach the p-region, see Fig. 16.5. 

Photons of frequency v having energies hv > E g , where E g is the bandgap energy, 
generate electron-hole (e-h) pairs. Due to a built-in electric field Eq in the depletion region, 
electrons move towards metal contact in the n-region whereas holes travel in opposite 
direction towards metallic contact in the p-region. Since the n-region is very narrow, most 
of the photons are absorbed in the depletion region. Each generated electron increases 
charge in the n-region by —e; similarly each hole makes the p-region more positive by +e. 
Thus potential difference is created between metallic electrodes. 

A semiconductor can only efficiently convert photons into current with energies equal to 
the bandgap. Photons with energies smaller than the bandgap are not absorbed, and photons 
with higher energies reduce their energies to the bandgap energy by thermalization of the 
photogenerated carriers; a process which involves losses. 

The basic relation which provides the link between photons’ energy and their wavelength 
is 


A.[pm] = 


1.24 

E\eV] 


(16.1) 


If both terminals of solar cell are shorted, the excess electrons on the n-side will start 
flowing through an external wire contributing to electrical current known as photocurrent, 
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incident light 



Solar cell perspective. 



The principle of photocurrent generation in a solar cell in short circuit. 


Iph, see Fig. 16.6. The short circuit conventional current I sc flows opposite to photocurrent. 
If hight is the light intensity, one can write 

At = Iph = K • Ilight 

where K is some (device dependent) constant. 
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Solar cell driving an external load resistance R. 
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V 




Finding an operating point of a solar cell. 

When the illuminated solar cell is loaded with resistance R, the conventional current / is 
I = I d — I ph , see Fig. 16.6, where T d is a forward diode current: 



(16.2) 


and where Iq is a constant, V is the voltage across diode and r/ is known as diode fidelity 
factor. It is equal to 1 for diffusion controlled and 2 for space charge layer recombination 
controlled characteristics. 

The I-V characteristic of solar cell is shown in Fig. 16.7. It is obtained from a dark 
diode by shifting its I-V curve downward by I p h . The analytical expression for conventional 
current is 



(16.3) 


With the load resistance R connected to the solar cell, see Fig. 16.6, the current-voltage 
relation is 


V 

R 


(16.4) 


Operating point of solar cell with load resistance R is obtained by solving Eqs. (16.3) and 

(16.4). The graphical solution is illustrated in Fig. 16.8., where we have plotted Eqs. (16.3) 
and (16.4). Crossing of both lines determines operating point (V\, I \) of a solar cell. 

The point at which a characteristic intersects the vertical current axis is known as a short 
circuit condition and the corresponding current as short-circuit current I sc . Similarly, the 
point at which I-V characteristic intersects the horizontal voltage axis is known as the open 
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Finding an operating point of a solar cell. F^ is an open-circuit voltage. 


I 



-O 


V 



Equivalent circuit of a solar cell. 


circuit condition. It defines the open-circuit voltage F^, which is the maximum voltage 
which can be drawn from the solar cell. It corresponds to zero current. 

Power delivered to the load R is Pu„d = I\ ■ V\, which is the area of the rectangle, see 
Fig. 16.7. The goal of the solar cell design is to maximize that power. 


16.3 Equivalent circuit of solar cells 


Over the years several methods of modelling solar cells have been introduced. The simplest 
approach is based on equivalent circuit models. Some recently published works with an 
emphasis on using MATLAB are [7], [8], [9], 


16.3.1 Basic model 


The equivalent circuit of solar cell which includes parasitic effects is shown in Fig. 16.9. 
It consists of current source which generates photoelectric current I ph , ideal diode, shunt 
resistor and series resistor. Based on this model, the goal is to obtain current-voltage relation. 
From Kirchoff rule one obtains (from now on we reverse direction of flow of conventional 
current, which is a common practice) 


7 — Iph Id 


(16.5) 


where / is the cell total current and Id is the diode current. Combining the above and 
relation (16.3), one finds 



(16.6) 
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This equation describes an ideal situation without parasitic effects. In a real solar cell 
there exists leakage current flowing through shunt resistance R s r . Potential drop across the 
device is represented by series resistance R s . The inclusion of shunt resistance modifies 
solar current / as 



(16.7) 


The impact of series resistance is included as 

V d = V + I-R s 

With the parasitic effects included, the solar current I takes the form 


(16.8) 



(16.9) 


The photocurrent I ph depends on the solar insolation and cell’s temperature and it is given 
by [10], [11] 


Iph = |4 +Ki (T c - T ref )\ S m 


(16.10) 


where I sc is the cell’s short-circuit current at a 25° C and 1 kW m 2 , Kj is the cell’s short- 
circuit current temperature coefficient, T„, f is the cell’s reference temperature and S m is the 
solar insolation in kW m 2 . 

The expression for Iq is [10], [11] 



(16.11) 


where Irs is cell’s reverse saturation at a reference temperature and a solar radiation, E gap 
is the bandgap energy of the semiconductor used in the cell and A is an ideal factor which 
depends on PV technology. Some typical values are: A = 1.2 for Si-mono, A — 1.3 for 
Si-poly and A = 1.5 for CdTe (after [10]). 


16.3.2 Other models 


There exist other models of solar cells which are based on a basic model. Here, we will 
only mention two such models: the double exponential model and ideal PV cell model. 

Double exponential model [12] is shown in Fig. 16.10. This model contains a light 
generated current source, two diodes D\ and Di and a series and parallel resistances. The 
model is derived from the physics of the p-n junction and can describe cells constructed 
from polycrystalline silicon. 

Simple models. The shunt resistance R s h is inversely related to shunt leakage current to 
the ground. PV efficiency shows little sensitivity to the variation of R s h and one can assume 
that R sh = oo. The resulting model is similar to the basic one, shown in Fig. 16.9, without 
R s h. The current-voltage expression for this model is 



(16.12) 
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Current-voltage characteristics of an ideal solar cell based on Si for several values of series resistance. 


An ideal PV cell model assumes no series loss and no leakage to ground; i.e. R s = 0 and 
R sh = oo. The equivalent circuit consists of only current source and one diode. Current is 
expressed as 

We conducted analysis using typical parameters, which are summarized in Table 16.1. 
Current-voltage characteristics for several values of series resistance are shown in Fig. 16.11 
and in Fig. 16.12 we presented current-voltage characteristics at three different values 
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Current-voltage characteristics of an ideal solar cell based on Si for several values of temperature. 


of temperatures. MATLAB code used to generate those characteristics is shown in the 
Appendix, Listings 16A.1, 16A.2 and 16A.3. These are typical results as reported in the 
literature. 


16.4 Multijunctions 


As determined by Shockley and Queisser [13], the maximum theoretical limit for a single 
junction solar cell is about 31%. In order to increase theoretical efficiency one needs to 
increase number of p-n junctions in the cell. Henry [14] found that maximum theoretical 
efficiency at a concentration of 1 sun is 31%. At a concentration of 1000 suns with the cell 
at 300 K, the maximum efficiencies are 37%, 50%, 56% and 72% for cells with 1, 2, 3 and 
36 p-n junctions (with different, properly chosen energy gaps), respectively. 

Increase in efficiencies can be obtained by splitting solar spectrum into several parts 
and using different materials for conversion in various parts. This principle is illustrated in 
Fig. 16.13 foratriple solar cell built from Gao.49f«o.5if’(l-9eV), Gao.99f«o.oi^(1.4eV)and 
Ge(0.7 eV) (adopted from Dimroth [15]). Germanium is used mainly due to its robustness 
and its low cost of production. Those different semiconductor materials are grown on 
one substrate. Tunnel junctions are used to form an electrical series connection of the 
subcells. 

Suitable materials must be chosen for each cell so that photons of appropriate wavelengths 
are absorbed. The bandgap of each material is determined using Eq. (16.1). Energy bandgap 
E g must decrease from top cell to bottom cell in order not to absorb all photons by the top 
cell. Also, all layers of semiconductors must be lattice matched, so that there is no built-in 
strain. 

The anti-reflective (AR) coating typically consists of several layers. Design of AR layers 
can be performed using methods discussed in Chapter 3. 
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wavelength 
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Middle cell: GaAs 


AR coating | 


Tunnel junction 
Bottom cell: Ge 


Energy distribution (kW/ m 2 pm) 



MiTTil.'REl Triple-junction solar cell. (From F. Dimroth, High-efficiency solar cells from lll-V compound semiconductors, Phys. Stat. 
Sol. (c) 3, 373-9 (2006). Copyright Wiley-VCH Verlag GmbH and Co. KGaA. Reproduced with permission.) 


The presence of tunnel junctions is to provide low electrical resistance between subcells. 
Modelling of those layers specifically for the purpose of MJ solar cells has been described 
recently by Baudrit and Algora [16]. 

16.4.1 Quantum dots in multijunctions 


The triple-junction subcells shown are connected in series and their voltages add up, but 
the current flowing through the structure is determined by the smallest current produced by 
subcells. Total power generated by the structure is greater than that of the single junction 
cell. 

The thickness of each subcell is determined by its absorption and the light power available 
in the spectral range where the subcell operates. Referring to Fig. 16.13, the bottom cell 
(formed by Ge, including substrate) is much thicker than two upper subcells. Due to this 
and also because it covers much larger spectral range, the Ge subcell produces much larger 
current compared to two upper subcells. 

One of the possible solutions of reducing current of the Ge subcell and at the same 
time increasing current of the GaAs subcell, is to introduce InAs-based quantum dots into 
the GaAs subcell [17]. These quantum dots are designed to have an effective bandgap 
slightly smaller than that of GaAs. Therefore they can ‘steal’ current from bottom Ga 
subcell roughly by about 15%. Increasing also the thickness of top GalnP subcell to match 
the increased current of GaAs results in an overall current increase by about 15%. With 
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^*■■0 ill E a 

— 0 - E v 

Illustration of a concept of intermediate-band solar cell. 

the minimal change in voltage over GaAs subcell, this concept resulted in an increase of 
triple-junction conversion efficiency by about 13%. 

16.4.2 Intermediate band solar cells (IBSC) 


The concept of intermediate band solar cells (IBSC) was first described by Luque and Marti 
[18] and by Wolf [19]. The main idea was to create an additional band between conduction 
and valence bands, see Fig. 16.14. The additional intermediate band can be created using 
several approaches, like, for example, minibands. 

The standard cell is only able to absorb photons with energy equal to or greater than 
E gap = E c — E v . Within this new scheme it is possible to have additional absorption via 
the IB. 

The electronic states in the IB should be accessible via direct transitions. Therefore 
photon of energy E IV can transfer an electron from the valence band into IB whereas 
another photon with energy Eci will be able to pump electrons from IB to conduction band. 
Thus the performance of solar cell is increased. 

Solar cells based on this design are attractive because of their predicted photon conver¬ 
sion efficiency of up to about 60% [20], The authors of Ref. [20] show that the ordered 
three-dimensional arrays of quantum dots, i.e., quantum dot supracrystals, can be used to 
implement the intermediate-band solar cell with the efficiency significantly exceeding the 
Shockley-Queisser limit for a single junction cell. The increase is due to the utilization of 
photogenerated hot carriers which can produce higher voltages and higher photocurrents. 

Modelling of such structures has been conducted using the CFD Research Corporation 
3D device simulator, NanoTCAD [21], They performed accurate simulations of quantum 
dot solar cell performance and degradation due to effects of space radiation. 

A recent summary of some aspects of intermediate-band solar cells was published in 
Nature Photonics [22], 


16.4.3 Role of simulations 


Numerical simulations and modelling of solar cells play an important role. One particular 
example has been mentioned in the preceding section. Optimization of modem structures, 
especially for multijunction or quantum dots by experimental trial-and-error methods, is 
definitely too costly. This opens the room for simulations. 
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The optical and electrical interactions between the numerous layers in a multi-junction 
solar cell are very complex. Over the years several groups have developed simulation models 
of various levels of sophistication. The simplest approach based on equivalent circuits 
model was summarized in an earlier section. There were also many successful attempts to 
develop microscopic models of various complexity. Full discussion of microscopic models, 
including those based on drift-diffusion approach, is beyond the scope of this book. 

Also, a few commercial products are found on the market: Sentaurus from Synopsys, 
Atlas from Silvaco or Solar Cell utility from RSoft. Some recent examples of using com¬ 
mercial software to simulate and design solar cells are described in [23], [24] and [25], 
The authors of Ref. [25] describe details of using Sentaurus to simulate quantum well solar 
cells. 

More sophisticated approaches to designing of quantum well solar cells using quantum 
transport are also under development [26], [27]. Such approaches combine elements from 
semiconductor optics and quantum transport in nanostructures. A very fundamental 
approach based on non-equilibrium Green’s functions (NEGF) is discussed extensively 
in [26]. The developed approach treats absorption, transport and relaxation on equal foot¬ 
ing and within a framework based on non-equilibrium quantum statistical mechanics. 

We finish this discussion by mentioning the role played by plasmonics in improving 
properties of photovoltaic devices, see a recent summary [28], Plasmonics describe guiding 
and localizing light at the nanoscale smaller than the wavelength of light in free space. 
Using plasmonics allows for new designs of solar cells in which light is fully absorbed in a 
single quantum well. Proposed solutions can result in improved absorption in photovoltaic 
devices. 


Appendix 16A: MATLAB listings 


In Table 16.2 we provide a list of MATLAB files created for Chapter 16 and a short 
description of each function. 

Listing 16A.1 Program solar_Si.m. MATLAB program which plots I-V characteristics of 
an ideal solar cell for several values of series resistance based on equivalent circuit model. 

’/ 0 File name: solar_Si.m 

°/. Simulates ideal solar cell based on Si 


Table 16.2 List of MATLAB functions for Chapter . 1 

Listing 

Function nai 

ne Description 

16A.1 

16A.2 

16A.3 

solar jSi.m 

current _V.m 

solar JSi_T.n 

Generates I-V characteristics for several values of series resistance 
Function used by solar_Si.m and solar_Si_T.m 
i Generates I-V characteristics for several values of temperature 
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clear all 
eta = 1.5; 

I_ph =0.02; % illumination current, in mA 

T = 330; 

V_init = 0; 

V_final =1.0; 

V_step = 0.01; 

V=0; 
hold on 

for R_s = [0.01 20 50] 7. series resistance, in ohms 

curr_total = fzero(@(curr) current_V(curr,V,T,eta,I_ph,R_s),0.2); 
for V = V_init:V_step:V_final 

curr = fzero(S(curr) current_V(curr,V,T,eta,I_ph,R_s),0.2); 
curr_total = [curr_total, curr]; 

end 

V_plot = V_init:V_step:V_final; % creation voltage values for plot 
V_plot = [0, V_plot]; 

plot(V_plot, curr_total,’LineWidth’,1.5) 

set(gca,’FontSize’,14); 7. size of tick marks on both axes 

axis([0 V_final*0.8 0 I_ph*1.2]); 

ylabel(’Current (mA)’,’Fontsize’,14) 

xlabelC’Voltage (V)’,’Fontsize’,14) 

text(0.65, 0.01, ’R_s = 0 \0mega’,’Fontsize’,14) 

text(0.45, 0.01, ’R_s = 20 \0mega’,’Fontsize’,14) 

text(0.2, 0.01, ’R_s = 50 \0mega’,’Fontsize’,14) 

end 

pause 

close all 


Listing 16A.2 Function current Vm. MATLAB function which creates an expression for 
current in the equivalent circuit model. 


function fun = current_V(I,V,T,eta,I_ph,R_s) 

7, Expression for current used in solar cell model based on equivalent 
7. circuit 


q = 1.6d-19; 
k_B = 1.38d-23; 
R_sh = 415; 

I_0 = 3d-9; 


7. charge of electron 

7. Boltzmann constant 

7, shunt resistance in ohms 

7, reverse saturation current of the diode 


VV = V+I*R_s; 

fun = I_ph - I - I_0*(exp(q*VV/(eta*k_B*T))-l) - VV/R_sh; 


Listing 16A.3 Function solar_Si_T.m. MATLAB program which plots current-voltage 
characteristics of an ideal solar cell for three temperatures using the equivalent circuit 
model. 
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7. File name: solar_Si_T.m 

’/o Simulates effect of temperature for ideal solar cell based on Si 
clear all 
eta = 1.5; 

I_ph = 0.02; 7» illumination current, in mA 

R_s = 10; 7, series resistance, in ohms 

V_init = 0; 

V_final =1.0; 

V_step = 0.01; 

V=0; 
hold on 

for T = [300 325 350] 7, temperature 

curr_total = fzero(Q(curr) current_V(curr,V,T,eta,I_ph,R_s),0.2); 

for V = V_init:V_step:V_final 

curr = fzero(@(curr) current_V(curr,V,T,eta,I_ph,R_s),0.2); 
curr_total = [curr_total, curr]; 

end 

V_plot = V_init:V_step:V_final; 7. creation voltage values for plot 
V_plot = [0, V_plot]; 

plot(V_plot, curr_total,’LineWidth’,1.5) 

setCgca,’FontSize’,14); % size of tick marks on both axes 

axis([0 V_final*0.8 0 I_ph*1.2]); 

ylabel(’Current (mA)’,’Fontsize’,14) 

xlabelC’Voltage (V)’,’Fontsize’,14) 

text(0.65, 0.005, ’300K’,’Fontsize’,14) 

text(0.38, 0.01, ’350K’,’Fontsize’,14) 

end 

pause 

close all 
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Metamaterials 




In this chapter we review the basic concept of metamaterials as those possessing simulta¬ 
neously negative permittivity and permeability over the same frequency range. Theoretical 
principles and basic experimental results are reviewed. Possible applications including 
cloaking, slow light and optical black holes are described. 


17.1 Introduction 


Metamaterials are artificially created structures with predefined electromagnetic properties. 
They are fabricated from identical elements (atoms) which form one-, two- or three- 
dimensional structures. They resemble natural solid state structures. Metamaterials typically 
form a periodic arrangement of artificial elements designed to achieve new properties 
usually not seen in Nature [ 1 ]. In a sense, they are composed of elements in the same way 
as matter consists of atoms. 

Metamaterials are characterized and defined by their response to electromagnetic wave. 
Optical properties of such materials are determined by an effective permittivity E e g and 
permeability /i e ff valid on a length scale greater than the size of the constituent units. 
In order to introduce such a description, one requires that the size of artificial inclusions 
characterized by d be much smaller than wavelength X, i.e. d <$ (A.. 

The name meta originates from Greek, /leza and means ‘beyond’. Main characteristics 
of metamaterials (MM) are: 

• man-made, 

• have properties not found in Nature, 

• have rationally designed properties, 

• are constructed by placing inclusions at desired locations. 

With modern fabrication techniques it is possible to create structures which are much 
smaller than the wavelength of visible light. An example of the unusual properties that such 
structures could have is the negative refractive index. Although the problem has a long 
history (for example, Mandelstam [2] in 1945 discussed negative refraction and negative 
group velocity), it was only in 1967 when Victor Veselago [3] indicated the possibility 
of materials having simultaneously negative /x and e. He demonstrated the technological 
potential that materials with a negative refractive index could have for applications in 
imaging. It recently made its way to optics, mostly due to rapid progress in nanofabrication. 
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Schematics of the elementary cells and the wavelength of external electromagnetic wave in two extreme cases. 


The most well-known property of metamaterials is the negative index of refraction 
(NIM - negative index materials). Another popular name used is left-handed material. This 
terminology will soon be explained. 

The situation is illustrated in Fig. 17.1. For a longer wavelength, one cannot sense 
the properties of individual constituent atoms, shown on the left, whereas the shorter 
wavelength, on the order of the distance between ‘atoms’, can effectively be used to 
determine some of the atom’s properties (like their locations). In metamaterials we deal 
with the situation on the left. 

Metamaterials can have controlled magnetic and electric responses over a broad range of 
frequencies. Those responses depend on the properties of individual elements (atoms). In 
the long-wavelength limit, where a <?C a, where a is the characteristic dimension and 7. is the 
wavelength of electromagnetic wave, one should perform some sort of averaging procedure 
to determine effective parameters of MM. Details of those procedures are discussed in the 
literature [4], In the end, it is possible to achieve the condition where s e g- < 0 and ii e ff < 0. 

A diagram which illustrates the classification of metamaterials is shown in Fig. 17.2. In 
general negative index materials do not exist in Nature; the rare exception is bismuth, which 
when placed in a waveguide shows a negative refractive index at a wavelength of a = 60 qm 
[5], There are no known naturally occurring NIMs in the optical range. However, artificially 
designed materials (metamaterials) can act as NIM. One should, however, notice that the 
occurrence of NIM needs both negative s e ff < 0 and fi e ff < 0 over the same frequency 
range. 

Metamaterials can open new avenues to achieve unprecedented physical properties and 
functionality unattainable with naturally existing materials. Optical NIMs promise to create 
entirely new prospects for controlling and manipulating light, optical sensing and nanoscale 
imaging and photolithography. 

17.1.1 Short history of MM 


The earliest discussion of the concept of negative refraction goes probably to Schuster 
[6], In the book entitled An Introduction to the Theory of Optics he noted that negative 
dispersion of the refractive index with respect to wavelength, i.e. dn/dX < 0, can produce 
negative refraction of light entering such a medium from a vacuum. Some other early works 
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on negative refraction include Lamb [7] published in 1904 and Pocklington [8] published 
in 1905. Historical aspects were summarized by Simovski and Tretyakov [9] and by Moroz 
[ 10 ]. 

In recent years the field of metamaterials has received remarkable attention with the 
number of published papers growing exponentially. This is due to unusual properties of 
such systems (see [4] for a recent review) and also important practical applications like 
perfect lenses [11], invisibility cloaking [12], [13], slow light [14] and enhanced optical 
nonlinearities [15]. Parallel to theoretical developments, there has been noted spectacular 
experimental progress in the field [16]. 

The subject of metamaterials is attracting enormous attention from researchers. We 
searched ISI Web of Knowledge, and as of October, 2011 there were 4545 published papers 
on the subject of ‘metamaterials’. The plot of the number of published papers is shown in 
Fig. 17.3. As can be seen, the number of published papers grows exponentially with some 
saturation observed recently. 

The modem research progress started in 1999 with the pioneering work by Pendry 
and collaborators [17] who discussed artificial electromagnetic structures, such as split¬ 
ring resonators (SRR) (in general, two concentric split rings), for which they predicted the 
existence of negative magnetic permeability. In those structures the incident electromagnetic 
wave excites circulating currents in the loops. These currents in turn create oscillating 
magnetic dipole moments. With proper design, those currents are resonantly enhanced, 
leading to a negative magnetic permeability. 

An SRR meta-atom can, in the long-wavelength (quasi-static) regime, be modelled as 
a resistor/inductor/capacitor (RLC) resonant circuit [18]. Stacks of such structures exhibit 
negative permeability. 

The creation of a negative electric permittivity is relatively easy. For instance, it is well- 
known that metals at optical frequencies exhibit a plasmon-like permittivity that can assume 
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Year 


Number of papers published on the subject of'metamaterials'. 


negative values. At smaller (e.g. microwave) frequencies, it was shown by Pendry et al. 
in 1996 [19] that an array of long wires acts as a diluted Drude metal when the electric 
field is oriented along the wire axis. Such a system allows for the existence of a negative 
permittivity below the effective plasma frequency. A combination of both magnetic and 
electric elements in a suitable design leads to negative index of refraction over a specified 
frequency band. 

A report on the first negative-index material (NIM) designed and fabricated along the 
above principles was published in 2000 by Smith et al. [20], It operated at microwave 
frequencies. The individual SRR had its resonance frequency at 4.845 GHz. The diameter 
of the wires was about 0.8 mm, which resulted in a plasma frequency of co p = 13 x 10 9 s _1 . 
There, the authors performed transmission measurements on a NIM structure having a 
range of frequencies over which the refractive index was negative for one direction of 
propagation. In a subsequent paper, Shelby et al. [21] (again in a transmission experiment) 
demonstrated a negative refraction of microwaves incident on the interface between air and 
NIM. 

In 2003, a group from Boeing Phantom Works reported [22] a very accurate measure¬ 
ments of a Snell’s law using a NIM wedge at frequencies from 12.6 to 13.2 GHz. 

Following those pioneering demonstrations, in the next few years the operation fre¬ 
quency has been increased by more than four orders of magnitude. In particular, Yen 
et al. [23] demonstrated a NIM response at 1 THz (A. = 300 pm) by making use of 
SRRs placed on a double-sided polished quartz substrate. Further design and fabrica¬ 
tion improvements have led to the possibility of creating a three-dimensional double¬ 
negative medium with subwavelength meta-atoms in the optical regime [24], [25], Indeed, 
in 2008, a three-dimensional photonic NIM at optical frequencies has been demonstrated in 
Ref. [26], 
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17.2 Veselago approach 

In this section we summarize an early approach as developed by Veselago [3]. 

17.2.1 Wave equation 

In order to proceed with a quantitative description, at this stage we will introduce Maxwell’s 
equations, which in the absence of free charges and currents are 

VxE = -/r^ and VxH=e^ (17.1) 

at at 

Take V x (rotation) operation on the first equation and then use second equation 

9 9 2 E 

VxVxE = - e -(VxH) = -^ 

Use general relation valid for arbitrary vector A 

V x V x A = V (V • A) - V 2 A = — V 2 A 

In the last step, we used V ■ E =0. One obtains the wave equation 

V 2 E = -ne d -^ = 0 (17.2) 

If we disregard losses and consider e and /x as real numbers, then one can observe that 
wave equation is unchanged when we simultaneously change signs of e and /x. 


17.2.2 Left-handed materials 


Start with the basic Maxwell’s equations (17.1). Assume time-harmonic fields 
E (x,y, z, t ) = E (x,y, z)e mt + c.c. 
and introduce wave vector k by assuming plane-wave dependence 
E, H ~ e ,k r 

Maxwell’s equations take the form 

k x E = -ixcoU (17.3) 

k x H = ea& (17.4) 

From Eqs. (17.3) and (17.4) and definition of cross product, one can immediately see 
that for e > 0 and /x > 0 vectors E, H and k form a right-handed triplet of vectors, and if 
e < 0 and /x < 0 they form a left-handed system, see Fig. 17.4. 
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E 


H 


H 



k E 



k 


(a) 


(b) 


(a) Right-hand orientation of vectors E, H, k for the case when s > 0,/r > 0. (b) Left-hand orientation of vectors 


E,H, k for the case when e < 0, /x < 0. The figure is taken, with permission of the Canadian Association of 
Physicists (CAP), from an article by Wartak et al. [27], 





Reflection and refraction in negative index material. The figure is taken, with permission of the Canadian Association 
of Physicists (CAP), from an article by Wartak et al. [21]. 


17.2.3 The refraction of a ray 


Consider propagation of a ray through the boundary between left-handed and right-handed 
media, in Fig. 17.5. Light crossing the interface at non-normal incidence undergoes 
refraction, that is a change in its direction of propagation. The angle of refraction depends 
on the absolute value of the refractive index of the medium. Here, 1 is the incident ray, 
2 is the reflected ray, 3 is the refracted ray assuming medium '2' is right-handed, and 4 
is the refracted ray when assuming that medium 'll is left-handed. In a metamaterial the 
refraction of light would be on the same side of the normal as the incident beam, see 
Fig. 17.5. The relation between angles is determined by Snell’s law. 


17.3 How to create metamaterial? 


17.3.1 Metamaterials with negative effective permittivity in the microwave regime 


At optical frequencies metals are characterized by an electric permittivity that varies with 
frequency according to Drude relation 



(17.5) 
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Schematic illustration of a periodic arrangement of infinitely long thin wires along z direction used in the creation of 
an effective plasma medium at microwave frequencies. The figure is taken, with permission of the Canadian 
Association of Physicists (CAP), from an article by Wartak et al. [27], 


Here co 2 = ^ is the plasma frequency, i.e. the frequency with which the plasma consisting 
of free electrons oscillates in the presence of an external electric field. Typical values for co p 
are in the ultraviolet regime. The other symbols are: TV is the electron density; e is charge of 
an electron and m its mass. The parameter y describes damping and its value, for example 
for copper is y « 4 x 10 13 rad s _1 . 

In the limit when y = 0, from Eq. (17.5) it follows that e < 0 for co < co p ; i.e. the 
medium is characterized by a negative permittivity. Considering typical values of co p , the 
resulting range of negative values of e is in the ultraviolet regime. Unfortunately, in this 
frequency range co y, and as a result losses dominate the behaviour of e. Thus, using 
metals to achieve negative e over this frequency range will be impractical (high losses) and 
the propagation of light will be mainly evanescent. 

To achieve negative e at microwave frequencies, Pendry et al. [28] proposed to use 
periodic structure consisting of long thin metallic wires of radius r arranged on a horizontal 
plane (xy), see Fig. 17.6. The unit cell of this periodic structure is a square whose sides 
have length equal to a. 

When electric field E = Eoe~ l</Ot ~^z is incident on this structure, it forces free electrons 
to move inside the wires in the direction of the incident field. Effective electron density N e ff 
of such structures which participate in plasma oscillations is N e jy = TV^r (with N being 
the electron density inside each wire, r the radius of a wire and a distance between wires), 
which is significantly smaller compared to N, thus reducing the effective plasma frequency. 
For example, for a wire with radius r — 1 qm and wire spacing a = 5 mm, one finds that 
N e ff ~ 1.3 x 10 -7 N; i.e. the effective electron density of the new medium is reduced by 
seven orders of magnitude compared to that of the free electron gas inside an isolated wire. 

Additionally, in such engineered structures the effective mass m e jf of the electrons is 
significantly larger compared to that of a free electron. To determine effective mass of an 
electron in this wired medium, we use the classical equation of motion of a moving electron: 

d(m\)/dt = e[E + vxBl-> — [my + eX] = —eV(® — v • A) (17.6) 
dt 

where e is the charge of an electron and v the velocity of an electron. One also has B = V x A 
and E = — Vyj — dA/dt. We divide the xy plane into circles of radius R c , centred at each 
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wire and having area equal to that of the square unit cell; i.e. R c = a/y/u. Furthermore, 
we assume that the wires are sufficiently apart from each other so that the magnetic field 
inside each circle arises only from the current / that flows perpendicularly to the centre of 
the circle, and that the field at the circumference of each circle vanishes; i.e. H(R C ) =0. 
Magnetic field intensity at a distance R from each wire is given by 



(17.7) 


Magnetic field H associated with a vector potential A according to the relation H = 
/i, ( y 1 V x A gives the following expression for the vector potential A: 



(17.8) 


where z is a unit vector along the z-direction. It has been assumed that A (R > R c ) = 0. For 
distances very close to the wires, i.e. for R —> r « R c , Eq. (17.8) gives 


A ——/«(a/r)z 


(17.9) 


Since both v and A point along z, the right hand side of the second part of Eq. (17.6) will 
vanish (for a given R); i.e. the so-called ‘conjugate momentum’ (m\ + eA) of an electron 
will be conserved along the z-direction. As a result, an electron will be moving in the above 
engineered medium with an effective mass, m e jf = eA(r)/v. Realizing that the current I can 
be simply re-expressed as/ = — (nr 1 ) (Nev), we finally obtain using Eq. (17.9) the effective 
mass of a moving electron inside our effective medium m e ff = 0.5 x HoNe 2 r 2 ln(a/r). 
Thus, for copper wires of radius r — 1 pm, being separated by a = 5 mm, we obtain: m e ff <=» 
1.3 x 10 4 m; i.e. the effective mass of an electron in our engineered medium is increased by 
more than four orders of magnitude. This, combined with the fact that the effective electron 
density is reduced by approximately seven orders of magnitude, leads to an effective plasma 
frequency that is in the microwave regime: 


^- = 5.1 x 10 10 [rads -1 ] 2 


2 


m eff e 0 


v p = (Dp/ln = 8.2 GHz 


(17.10) 


It should be noted that, based on Eq. (17.10), the calculated wavelength, k p = c/v p , which 
corresponds to our medium’s effective plasma frequency, turns out to be considerably larger 
compared to the periodicity of the structure (k p ~ 7 a), justifying the description of the 
periodic structure as an effective medium. Therefore we are able to construct an engineered 
medium that can exhibit a negative electric permittivity in the microwave regime with 
reasonably low losses. 


17.3.2 Magnetic properties: split-ring resonators 


In the previous section, we examined how to construct an artificial structure possessing 
negative effective permittivity e. Here, we will show how to create negative effective 
permeability /x in the microwave regime using the so-called split-ring resonators (SRR). 




392 


Metamaterials 



Perspective view of split-ring resonator. 
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w 
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W 7 



zf=^ 

W 7 

W 7 



(c) 



| Arrangement of SRR in space. Reprinted with permission from H. Chen et al.,J. Appl. Phys., 100 , 024915 (2006). 
Copyright 2006, American Institute of Physics. 


The perspective view of SRR is shown in Fig. 17.7. The structure consists of two cut 
cylinders. The model used to represent SRR is illustrated in Fig. 17.8. Here we will describe 
description of split-ring resonators (SRR) based on equivalent circuit model approach [18]. 
The structure under consideration is shown in Fig. 17.8. 
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We will establish the equivalent circuit model of this structure and show that it can 
produce negative effective permittivity //. The unit cell shown above can be modelled using 
equivalent circuit model shown in Fig. 17.9. 

Apply a time-varying external magnetic field Hq in the y-direction. It will induce a 
current / flowing in each SRR unit. From Faraday’s law, the voltage V due to the external 
field H 0 is 

V = icononr^Ho (17.11) 


where r is the radius of the ring. Along the y-direction, SRR loops form a column (stack) 
which behaves like a solenoid. In each loop there is a current I flowing. Neglect the fringing 
effects, i.e. spreading of the magnetic field lines. The magnetic flux in such a column is 
thus 

<J> = jrrVoy (17.12) 

where / is the separation between loops. The inductance L appearing in the circuit model is 


L = 


r 2 


(17.13) 


The inductance L is defined per ring of an infinite column of rings. 

Let L' be the total inductance of the SRR emits in the other column (excluding column 
represented by inductance L). Coupling between L and H is represented by the mutual 
inductance M. For very long columns in y-direction, the mutual inductance M is 


., «>£ ,. nr 2 4>d r xr 2 

M — — — lim —- — = lim —- (n — 1 )L 

= 7tr ^L = F-L ( 17 . 14 ) 

a 2 

Here <t> 7 is the flux of the depolarization field located in the interior of L, <t> = L ■ I is the 
depolarization field generated by one column of the rings, </v = (n — l)d> = (n — \ ) L ■ I 
is the flux of the total depolarization field, and F = 71 £ is the fractional volume of the 
periodic unit cell in the xz plane occupied by the interior of the SRR. 
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Applying KirchofF’s voltage drop law for a loop in the equivalent circuit shown in 
Fig. 17.9 gives 

V = R ■ I + —— + {—icoL) I — (-icoM)I (17.15) 

—icoC 

where C = \C g is the total capacitance in the loop. 

From the previous equations 

V = i(on.Qitr 1 HQ = icoLI H 0 (17.16) 


From Eq. (17.15) one has 



I 


Solving for current 

J _ V _ io) LI H 0 

R — -L — icoL + icoM R— — icoL + iooF ■ L 
LH 0 _ -LHp 

~ m + ^c~^ + F ~ + 

Magnetic dipole moment per rmit volume of the material is 


with the current / inferred from the above equation to be 


(17.17) 


/ = 


H 0 1 

(1 - F) - 1 / (co 2 LC) + iR/(coL ) 


(17.18) 


As a result, the (relative) effective magnetic permeability associated with this medium will 
be (in the direction, x, that the incident magnetic field is polarized) 


B/lM) _ _ F _ 

B/ii o -M d ~ 1 - 1 / ( co 2 LC) + iR/ ( eoL ) 


(17.19) 


From Eq. (17.19) we can see that fi assumes negative values in the range: 1 /a/ZC < 
w p < 1 f«jLC(\ — F), where w m o = 1 /*/LC is the resonance frequency of the Lorentzian 
variation of the medium’s magnetic permeability, and co mp = 1 /sjLC( \ — F) is the cor¬ 
responding plasma frequency (where Re{/i} = 0). Crucially, we note that the resonant 
wavelength (A. m0 ) of the structure depends entirely on the rings’ effective inductance (L) 
and capacitance (C), and can therefore be made considerably larger than the periodicity (a) 
of the structure, thereby fully justifying its description as an effective medium. 

Plot of real and imaginary parts of effective magnetic permeability is shown in Fig. 17.10 
and the MATLAB code is in Appendix, Listing 17A.1. 

The combination of long wires and SRR in a unit cell is shown in Fig. 17.11. The effect 
of both results in a negative effective permittivity and permeability over the same frequency 
band. 

We finish this section by showing in Fig. 17.12 schematic plots of effective permittivity 
(s e ff) and permeability 
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Frequency (GHz) 

Real and imaginary parts of magnetic permeability. 



Elementary cell of metamaterial formed by SRR and thin wire. The figure is taken, with permission of the Canadian 
Association of Physicists (CAP), from an article by Wartak et al. [27], 



(a) (b) 


(a) Permittivity of wire medium demonstrating plasma-like frequency dependent permittivity and (b) Frequency 
response of effective permeability. The figure is taken, with permission of the Canadian Association of Physicists (CAP), 
from an article by Wartak etal. [27], 

17.4 Some applications of metamaterials 


Materials with such unusual properties allow for unusual applications. We do not attempt 
to review all of them; we just concentrate on a few interesting possibilities. We start with 
the possibility of creating a so-called ‘perfect lens’. 
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f Double focussing of a source with a planar double negative (left-handed) metamaterials slab surrounded by regular 
(right-handed) dielectrics. 


17.4.1 Perfect lenses 


Until recently, it was thought that the manipulation of light is limited by the fundamental 
law of diffraction to a relatively long wavelengths (say around 0.5A). The sub-wavelength 
details are carried by evanescent harmonics which decay exponentially and are also subject 
to noise. A conventional lens only collects the propagating waves. The evanescent waves 
are lost due to their decay. 

A planar slab consisting of NIM with sufficient thickness can act as a lens, known 
as Veselago lens. Such a slab lens with refractive index n = — 1 placed in vacuum (see 
Fig. 17.13) can resolve details of an object with subwavelength precision [1 1], Detailed 
mathematical analysis is discussed, for example, by Ramakrishna and Grzegorczyk [4] and 
Cai and Shalaev [29], 

To understand the problem, consider a slab of thickness d and refractive index of, e.g. 
n = — 1, surrounded by air. ft will bring all rays emanating from a source to a double focus: 
first, at a point inside the NIM slab, at a distance s = l < d, where / is the distance of the 
source from the slab, and second at a point outside the slab, at a distance d — l. Hence, such 
a slab acts like a lens, and is able to bring the rays radiated by a source to a focus outside 
the slab, without reflections occurring at the media interfaces because the n — — 1 slab is 
impedance-matched to free space. 

Evanescent waves are associated with the high spatial frequencies of the electromagnetic 
waves created by source. They carry fine (subwavelength) features of the source. Therefore, 
a NIM slab can, in principle, enable us to obtain the image of an object with ‘perfect’ 
resolution, containing all the subwavelength features of an object and overcoming the usual 
diffraction limitations that characterize conventional lenses. 

17.4.2 Stopped light in metamaterials 


For decades scientists maintained that optical data cannot be stored statically and must be 
processed and switched on the fly. The reason for this conclusion was that stopping and 
storing an optical signal by dramatically reducing the speed of light itself was thought to 
be unfeasible. 

At present, some of the most successful slow-light designs based on photonic-crystals 
(PhCs) [30] or coupled-resonator optical waveguides (CROWs) [31], can indeed slow 
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| Classical G-H shift in regular dielectrics. The figure is taken, with permission of the Canadian Association of Physicists 
(CAP), from an article by Wartak etal. [27], 



| G-H shift in NIM forming central layer. The figure is taken, with permission of the Canadian Association of Physicists 
(CAP), from an article by Wartak etal. [27], 


down light efficiently by a factor of only 40; otherwise, large group-velocity-dispersion 
and attenuation-dispersion occur; i.e. the guided light pulses broaden and the attainable 
bandwidth is severely restricted. 

Recently Tsakmakidis et al. [14] proposed a new method that can allow for a true 
stopping of light in NIM. The stopping of light in the proposed configuration is associated 
with a negative Goos-Hanchen (G-H) phase shift, a lateral displacement of light ray when 
it is totally reflected at the interface of two different dielectric media. Classical G-H shift 
between dielectrics having positive refractive indices is illustrated in Fig. 17.14. 

In a structure where the central layer is formed by NIM, the G-H shift is reversed as it is 
illustrated in Fig. 17.15. 

To more precisely understand the manner in which light is decelerated in this structure, 
let us imagine a ray of light propagating in a zig-zag fashion along a waveguide with a 
negative-index (‘left-handed’) core. The ray experiences negative Goos-Hanchen lateral 
displacements each time it strikes the interfaces of the core with the positive-index (‘right- 
handed’) claddings, see Fig. 17.15 . Accordingly, the cross points of the incident and reflected 
rays will sit inside the left-handed core and the effective thickness of the guide will be 
smaller than its natural thickness. It is reasonable to expect that by gradually reducing 
the core physical thickness, the effective thickness of the guide will eventually vanish. 
Obviously, beyond that point the ray will not be able to propagate further down, and will 
effectively be trapped inside the negative index metamaterial (NIM) heterostructure, see 
Fig. 17.16. 

The authors of [14] proposed stopping a light pulse by varying the thickness of the 
waveguide core to the point where the cycle-averaged power flow in the core and the 
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| G-H shift in NIM at critical thickness. The figure is taken, with permission of the Canadian Association of Physicists 
(CAP), from an article by Wartak etal. [27], 


cladding become comparable. At the degeneracy point, where the magnitudes of these 
powers become equal, the total time-averaged power flow directed along the central axis 
of the core vanishes. At this point the group (or energy) velocity goes to zero and the 
path of the light ray forms a double light cone (‘optical clepsydra’) where the negative 
GH lateral shift experienced by the ray is equal to its positive lateral displacement as it 
travels across the core. Adiabatically reducing the thickness of the NIM core layer may 
thus, in principle, enable complete trapping of a range of light rays, each corresponding to 
a different frequency contained within a guided wavepacket. 

This ability of metamaterial-based heterostructures to dramatically decelerate or even 
completely stop [32] light under realistic experimental conditions, has recently led to a series 
of experimental works [33], [34] that have reported an observation of so-called ‘trapped 
rainbow’ light-stopping in metamaterial waveguides. 


17.4.3 Cloaking (invisibility) 


Unusual light-bending properties constitute a rather generic feature of metamaterials. Part 
of the excitement surrounding these materials is that they could be engineered to ‘cloak’ 
objects from electromagnetic radiation such as light: that is, make them seem invisible at 
specific frequencies. A metamaterial ‘invisibility’ cloak can be designed such that it does not 
reflect waves back nor scatter them in other directions. Several methods were proposed to 
make extended bodies invisible, such as those based on cancellation of scattering [35] or on 
coordinate transformations. The method suggested by Pendry et al. [36] (see also Leonhardt 
[12]) relies on controlling the paths of electromagnetic waves. It was applied to a spherical 
volume and uses coordinate transformation that expels the paths of electromagnetic waves 
(rays) from a spherical volume, squeezing them into a spherical shell around the volume 
that is to be cloaked, thereby making it invisible to incident radiation, see Fig. 17.17. The 
light rays smoothly avoid the cloaked object and flow around it like a fluid. They appear 
to have properties of the free space when observed externally. The rays make a detour 
around the hidden part of the device. On the left side they must arrive at the same time 
as if they were propagating through empty space. Since in the cloaking region they travel 
longer distances, their phase velocity must exceed c. This is in principle possible [37] (for 
a specific frequency). 
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-3 



| The rays go around the inner object and then go to the eye. An observer at the left of the cloak would see the point 


source. 


To introduce the relevant transformation, one starts with a mathematical point (placing it 
at the origin of a coordinate system), which is obviously invisibly small. To hide an extended 
object, say a sphere of radius R\, we transform the mathematical point at the origin into a 
sphere of radius R\ and the vicinity of the sphere into another sphere of radius R 2 > R\ ■ 
The transformation which does this is 



(17.20) 


Maxwell’s equations are form-invariant to coordinate transformations like the one above. 
Only the components of // and s are affected by the transformation. They become spatially 
varying and anisotropic tensors. Their forms were determined by Pendry et al. [36] and for 
Ri < r < R 2 are 



(17.21) 


Inside the sphere with radius R\ the permittivity and permeability can be arbitrary, whereas 
the space between the spheres is filled with a material having the permittivity and per¬ 
meability tensors determined by Eqs. (17.21). As a result, any object inside the sphere 
with radius r < Ri is concealed. These conclusions follow from exact manipulations of 
Maxwell’s equations and are not restricted to a ray approximation. 


17.4.4 Optical black holes 


By extending the concept behind the invisibility cloak, one can speculate on the possibility 
of creating an optical analogue of a black hole. In this concept, by proper design of a 
new class of metamaterials, one can expect to concentrate and trap light waves, similarly 
to what can happen in a ‘black hole’, see Fig. 17.18. In such a system, light will be 
permanently trapped. Such a black hole design has recently been proposed by Narimanov 
and Kildishev [38], Numerical simulations showed a highly efficient light absorption. The 
electromagnetic black hole was built recently [39], [40] and it operates at microwave 
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; Metamaterial black hole fabricated from properly crafted gradient-index material. 


frequencies. The structure is composed of 60 concentric layers, and each layer is a thin 
printed circuit board etched with a numbers of subwavelength unit structures on one side and 
coated with 0.018 mm thick copper on the other side. The permittivity changes radially in 
the shell of the microwave black hole, and hence the emit cells are identical in each layer but 
have different sizes in adjacent layers. The structure can efficiently absorb electromagnetic 
waves coming from all directions owing to the local control of electromagnetic fields. It is 
expected that such a microwave black hole could find important applications in solar-light 
harvesting, thermal emitting, and cross-talk reduction in microwave circuits and devices. 
These specially designed analogues of black holes could also be used for controlling, 
slowing and trapping electromagnetic waves [40], as well as for investigating some of the 
more exotic physics associated with celestial mechanics. 


17.5 Metamaterials with an active element 


This last section is intended to briefly discuss the problem of losses in metamaterials and 
how to compensate for them. 

As mentioned before, metamaterials show large losses which at present are orders of 
magnitude too large for practical applications and are considered as an important factor 
limiting practical applications of metamaterials. For example, detailed analytical studies 
show that losses limit the superresolution of a theoretical superlens [41], There was some 
controversy whether loss elimination can be feasible [42], but as shown by Webb and Thylen 
[43] it is possible to completely eliminate losses in metamaterials. 

Very recently, several computational [44], [45], [46] and experimental [47] works have 
demonstrated that optical losses can be fully overcome in realistic negative-reffactive- 
index metamaterials. The specific loss-free design considered in [44] and [47] consisted of 
two metallic films perforated with small rectangular holes (‘fishnets’), and with an active 
medium (laser dye) spacer between two films. Additionally, several reports [48], [49], 
[50] speculated about possible compensation for losses in metamaterials by introducing a 
gain element. For example, Wegener et al. [49] formulated a simple model where gain is 
represented by a fermionic two-level system which is coupled via a local-field to a single 
bosonic resonance representing the plasmonic resonance of the metamaterial. Also recently, 
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Fang et al. [50] described a model where the gain system is modelled by a generic four- 
level atomic system. They conducted numerical analysis using the FDTD technique. Gain 
material was introduced in the gap region of the split-ring resonators (SRR). The system 
had a magnetic resonance frequency at lOOTHz. Some other reports of the design and 
analysis of active metamaterials are by Yuan et al. [51] and Sivan et al. [32], 

17.6 Annotated bibliography 


We finish this short review by highlighting some recently published textbooks where further 

useful information about metamaterials can be found. We start with the most recent: 

• T. J. Cui, D. R. Smith, R. Liu, eds., Metamaterials. Thory, Design, and Applications, 
Springer, 2010. 

Concentrates on the recent progress in metamaterials, in particular the optical trans¬ 
formation theory, invisible cloaks, new type of antennas and optical metamaterials. 

• W. Cai and V Shalaev, Optical Metamaterials. Fundamentals and Applications, Springer, 
2010. 

Details recent advances on optical metamaterials from fundamental aspects to current 
implementations. 

• F. Capolino, ed ., Metamaterials Handbook. Vol. 1: Theory and Phenomena of Metama¬ 
terials, Vol. 2: Applications of Metamaterials, CRC Press, Boca Raton, 2009. 

A very broad collection of topics covered by many authors. 

• L. Solymar and E. Shamonina, Waves in Metamaterials, Oxford University Press, 2009. 

Offers a comprehensive treatment of all aspects of research in this field at a level 
that should appeal to final year undergraduates in physics or in electrical and electronic 
engineering. 

• S. A. Ramakrishna and T. M. Grzegorczyk, Physics and Applications of Negative Refrac¬ 
tive Index Materials, SPIE and CRC Press, 2009. 

Discussion of principles of negative refraction and comparison with other media that 
exhibit similar properties. 

Appendix 17A: MATLAB listings 


In Table 17.1 we provide a list of MATLAB files (in fact, only one file) created for Chapter 17 
and a short description of its function. 



Table 17.1 

List of MATLAB functions for Chapter . i 

Listing 

Function name 

Description 

17A.1 

ChenFigi.m 

Reproduces Fig. 3 from the paper by Chen et al. [18] 
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Listing 17A.1 Program ChenFig3.m. 


7, File name: ChenFig3.m 

7. Matlab code to plot Fig.3 of Chen et al 
7. using equivalent circuits approach 
7. Plot Fig3. of Chen et al, JAP,v. 100, 024915 (2006) 
clear all 

7. Fundamental constants 

epsilon_zero=8.8542d-12; ’/, epsilon zero (F/m) 
mu.zero = 4*pi*10~-7; l mu zero (H/m) 

7. 


7. Geometrical dimensions, 
r = 2.0; 
a = 5.0; 
l_vert =1.0; 

L_c = 1.0; 
t = 0.8; 
d_c = 0.2; 
eps_r = 4.0; 
sigma = 10"-6; 


se Fig.l of Chen 
7. internal radius = 2 mmm 
7. distance between elements = 5 mm 
7. vertical separation = 1 mm 
7. Dimension of capacitor = 1 mm 
7. Dimension of capacitor = 0.8 mm 
7. Dimension of capacitor = 0.2 mm 
i, permittivity of the gap 
7. resistance per unit length 


7. Intermediate parameters 

F = pi*r"2/(a~2) ; 7. fractional volume (dimensionless) 

C_g = (epsilon_zero*eps_r*L_c*t/d_c)*10~-3; °l gap capacitance unit (F) 


L = (mu_zero*pi*r~2/l_vert)*10~-3; 
7. pause 

R = 2*pi*r*sigma; 

C = C_g/2.0; 


7. Eg. (2) unit (H) 


7, resistance of the ring 
7. total capacitance in the loop 


7. Determination of frequency range 
N_max = 200; 

freq_GHz = linspace(0,10,N_max); 
freq_Hz = freq_GHz*10~9; 
omega_Hz = 2.0*pi*freq_Hz; 


7. number of points for plot 
7. in the interval [0,10] GHz 
7. creation of frequency arguments 


tt = l./(omega_Hz.~2*L*C); 
ttt = R./(omega_Hz.*L); 
temp = l./tt + li*ttt; 


7. effective permeability without coupling M, Eq. (9) of Chen 
extra =F./(1. +F-tt+ li*ttt); 


mu_eff = 1. - F./(1. + F - tt + li*ttt); 
result_re = real(mu_eff); 
result_im = imag(mu_eff); 
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% 

’/o plot(freq_GHz,result_im) ; 
i, pause 

plot(freq_GHz,result_re,freq_GHz,result_im,’ . ’LineWidth’,1.5); 
axis([0 10 -20 20]) 

xlabel(’Frequency (GHz)’,’FontSize’,14); 
ylabelCEffective permeability’,’FontSize’,14); 

set (gca,’FontSize’, 14) ; °/ 0 size of tick marks on both axes 

grid 

pause 

close all 
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Appendix A Basic MATLAB 



In this Appendix we discuss basic elements of MATLAB, including code design and 
specific examples of programming in MATLAB. Some introductory books are: MATLAB 
Programming for Engineers by Chapman [ 1 ], MATLAB. An Introduction with Applications 
by Gilat [2] and Mastering MATLAB 5. A Comprehensive Tutorial and Reference by 
Hanselman and Littlefield [3] - a reference book with a broad range of examples. 
MATLAB can operate in two modes: 

1. from the command window (interactive mode), where one introduces commands at 
MATLAB prompt 

2. from the m-file (script). All commands are written in a file (with *.m extension). Such 
files are activated (run) by typing their names (without an extension) at MATLAB 
prompt. Each *.m file can call several other *.m files. 

We would always advocate using m-files. They can be reused, combined with other 
programs, etc. In short, all code can be used in both ways. 

The Appendix is divided into several sections, each dealing with different aspects of 
MATLAB. In the code provided, we put many comments describing its workings. We 
suggest that in each case the reader runs the code and analyses its outputs. 

In Table A.l we provide a list of MATLAB files created for this Appendix A and a short 
description of each function. 

Before we start a more detailed description of MATLAB and its rules, we will introduce 
and run sessions dealing with fundamental MATLAB issues. 



Table A.l 

List of MATLAB functions for Chapter . j 

Listing 

Function name 

Description 

A.l 

intro session.m 

Preliminary use of MATLAB 

A.2 

memory.m 

Compares execution times for various memory handling 

A.3 

loops.m 

Compares execution times for various loops 

A.4 

basic J.D _plot.m 

Basic 2D MATLAB plot 

A. 5 

sub-plots.m 

Introduces subplots 

A. 6 

pview.m 

3D plot 

A.7 

filejwrite.m 

Writes to a file 

A. 8 

file .read.m 

Reads from file 

A.9 

deriv.m 

Numerical differentiation using MATLAB function dif f () 
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A.1 Working session with m-files 


Here, we will illustrate preliminary use of MATLAB using the m-file. The full code named 
intro s ession.m is shown below. We suggest run this code and observe the results. Try also 
to modify it and analyse the outputs. We put many comments inside the code to explain its 
operation. 

7. File name: intro_session.m 

7.-Initialization- 

clear all 

’its me’ l outputs text on screen 

pause 

d=6; % assigns value 6 to variable d 

C= [1 2 3; 3 2 1; 4 5 6] defines matrix C 
C 

who V, lists defined matrices 

pause 

whos 7. lists the matrices and their sizes 

pause 

7. -Colon operator- 

x =C( : , 3) 7o selects third column of C matrix 

pause 

format long % shows variables on a screen in a long format 

7. format short 
t = 0:0.1:2; 

t l lists value of t 

pause 

t’ 7o transpose operator, creates vertical vector 

pause 

l -Special values and special matrices- 

pi 7o shows the value of pi 

li 

Inf 

clock % year, month, day, hour, minute, seconds 

pause 

date 7 0 date in string format 

eps 7, the smallest number on my computer 

pause 

A = zeros (4) '/, creates 4x4 matrix consisting of zeros 

B = zeros(4,3) creates 3x4 matrix consisting of zeros 

pause 

A1 = ones(3) % creates 3x3 matrix consisting of ones 

B1 = ones(4,3) % creates 3x4 matrix consisting of ones 
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G = ones(l,5) 7. creates vector, length 5 consisting of ones 

pause 

D = eye (3) 7. creates identity matrix 

z=input(’input z’) ; 

z 7. shows inputed number 

pause 

7.-Prints a short table of cos values- 

clc 7, clears screen 

n = 21; x = linspace(0,1,n); y = cos(2*pi*x); 
disp(’ ’) 

disp(’k x(k) cos(x(k))’) 

disp( *-’) 

for k=l:n 

degrees = (k-l)*360/(n-l); 

fprintf(’ 7.2. Of 7.5.Of 7.8.3f \n’,k,degrees,y(k)); 

end 

disp( ’ ’); 

disp(’x(k) is given in degrees.’) 
pause 

7.-Output Options- 

a = 1.23456789; 
format short 

a 7. display a in short format 

pause 

format long 

a 7. display a in long format 

disp(a) 7. another display of a in long format 

fprintf(’Its me,\n 7.4.24f ’, a) 

pause 

7.-Data files- 

save data_l x y; 7. as *.mat file 

pause 

load data_l; 
z=x’ 

save data_5.dat z /ascii; 
pause 

7.-Scalar and array operations- 

clear all 
2+5 

A= [1 2 3; 3 2 1; 4 5 6]; B=[2 3 4; 4 5 6; 5 6 7]; 

7. 

C1=A+B 7. addition of two matrices 

C2=A*B 7. matrix multiplication 
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C3=A.*B % element by element multiplication 

C4=A./B % element by element division 

C5=A.~B % element by element exponentiation 

pause 


A.2 Basic rules 


Several of the special symbols used in MATLAB are listed in Table A.2. All variables 
in MATLAB are treated as matrices. Each operation is therefore considered as a matrix 
operation and as such has been optimized for vector and matrix operations. For the best 
use, one should vectorize algorithms and loops and also reserve in advance memory for all 
objects (matrices). We summarize basic rules and illustrate them with preliminary examples. 

• vectorization of loops. This means replacement of for and while loops by proper vector 
or matrix notation. 

Suppose that we need to generate 101 values of sin function in the interval from 0 to 1. 
It can be done in two ways: 

1. using for loop (no vectorization) 

tO = clock; for n = 0:0.001:1 
x = sin(n) 

time_no_vect = etime(clock,tO) 

Run time on my computer is time no vect = 0.2300. 

2. applying vectorization as shown below 

tO = clock; 
n = 0:0.001:1; 


j Table A.2 Some ofthespedal symbols used in MATLAB. j 

Symbol 

Description 

B 

assigning value 

[] 

construction of an array 

+ 

array addition 

- 

array subtraction 

.* 

array multiplication 

* 

matrix multiplication 

./ 

array right division 


transpose operator 

% 

beginning of a comment 
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x = sin(n); 

time_vect = etime(clock,tO); 

Now, run time on the same machine is timejvect = 0.0810. In order to determine 
time difference between start and end points, we used MATLAB function etime, which 
accurately computes time differences over many orders of magnitude. 

• reserve memory. 

• use colon (:) notation 

• limit using for in loops 

• use m-files with functions instead of scripts 

• provide extensive comments 

• create descriptive names for variables 

• use dot (.) notation 

Some of the above rules will be discussed now in more detail. 


A.3 Some rules about good programming in MATLAB 


Here we summarize several general rules of programming including those specific to 
MATLAB (but not solely). 

1. Always comment your program, so another person (even yourself after several months) 
can understand it. 

• use meaningful variable names, 

• use emits consistently in your formulas or (better) work with dimensionless (or scaled) 
variables. 

2. All calculations in MATLAB can be either performed in the Command Window or ni¬ 
fties can be created containing the same source code as typed in the Command Window, 
m-files play the role of functions or subroutines. 

3. A semicolon at the end of all MATLAB statements suppresses printing the assigned 
values on the screen, in the Command Window. This speeds up program execution. 

A.3.1 Preallocate memory 


Always preallocate memory for the generated matrix. It increases execution speed because 
MATLAB does not need to increase matrix size after determining each element. Run 
and experiment with program memory res.m shown below. A significant difference in the 
execution time can be observed. 

% File name: memoryMSW.m 

% comparison of execution time with and without memory reservation 
clear all 

7 , regular loop without memory allocation 
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7. we use functions tic and toe to measure elapsed time 
N = 10000; 

tic; 7. start timer 

n = 0; 

for x = 1:N 

n = n + 1; 

y(n) = cos(2*pi*x); 

end 

toe 7. measure elapsed time since last call to tic 

7. 

7. the same loop with preallocated memory 
tic 
n = 0; 

y =zeros(l,N); 
for x = 1:N 

n = n + 1; 

y(n) = cos(2*pi*x); 

end 


A.3.2 Vectorize loops 


MATLAB has been designed for vectorial operations. They execute much faster than regular 
loops. The comparison of both methods is illustrated in the program loops, m shown below. 
We use MATLAB functions tic and toe to measure elapsed time. 

7. File name: loops.m 

7. comparison of execution time for regular loop and vector loop 
7. regular loop is executed first 

7. we use functions tic and toe to measure elapsed time 
tic; 7. start timer 

n = 0; 

for x = 1:0.1:1000 
n = n + 1; 
y(n) = cos(2*pi*x); 

toe 7. measure elapsed time since last call to tic 

7. 

7. the same loop is vectorized below 
tic 

x = 0:0.1:1000; 
y = cos(2*pi*x); 
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A.4 Basic graphics 


Very good quality graphics is one of the reasons for MATLAB’s popularity. Here we 
summarize the basic rules. 


A.4.1 Basic 2D plot 


A basic plot is produced in two steps: 

1. Creating two arrays of equal lengths, one containing values of independent variable x 
and the other containing dependent variable y = fix). 

Creation of an array of values of independent variable can be done in several ways. Below 
we created arrays of points ranging from 0 to 2n using different methods. 

xl = 0: pi/100:2*pi; 7 100 evenly spaced points 

x2 = [0:0.25:2*pi] ; 7. predefined spacing of 0.25 

x3 = linspace(0,2*pi,100); 7o 100 evenly spaced points 

2. Inputting created arrays x and y into the MATLAB plot function. 

The steps are illustrated in the MATLAB program below. 


clear all 

x = 0:pi/100:2*pi; 

y = cos(2*x); 

plot(x,y) 

pause 

close all 


’/„ clears all variables in memory 

*/. creates array containing independent variable 

°/„ creates array containing function values 

7. creates generic plot, without description 

*/. pause, to view the plot 

7, closes all figures 


A.4.2 Two-dimensional plots 


The main function used here is plot. It can be called up with different number of arguments 
plot(y) plot(x,y plot(x,y,’line_type’) 

Here, we illustrate several ways to use it. 

Listing: Basic 2D plot. 

Here we show creation of the simplest two-dimensional plot by plotting function y = 3x, 
adding basic description and grid. The graph generated by basic 2D plot.in file is shown 
in Fig. A.l. 

MATLAB code is shown below. It explains creation of a basic two-dimensional graph. 

7. File name: basic_2D_plot.m 

7, Basic plotting of MATLAB are explained. 

clear all 

7. Generate data for plot 

n = 100; 7. number of plotting points 

x = linspace(0,10,n) ; 7. generates points on x-axis 
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Plot of function 3*x 





Basic two-dimensional plot. 


y = 3*x; “/, function to be plotted 

7. Creation of basic plot and adding description 

h = plot(x,y); % basic plot 

pause "/, stop to analyse plot 

titleC’Plot of function 3*x’, ’FontSize’ , 14) % adding title 

xlabel(’x value’ , ’FontSize’ , 14)adding text on x-axis and size of x label 

ylabelC’y value’,’FontSize’, 14)adding text on y-axis and size of y label 


pause(2) 
grid 

set(h,’LineWidth’,1.5); 
setfgca,’FontSize’,14); 


"/, stop for 2 seconds 
7. adding grid 

"/, new thickness of plotting lines 
7, new size of tick marks on both axes 


pause 7, final stop 

close all 7. closing all windows 

Listing: Creating subplots. 

Function subplot (2,2,1) divides the Figure window into 2 vertical and 2 horizontal 
windows and places the current plot into window 1 (top, left). In the remaining code other 
types of plots are created and placed in the remaining subwindows. 

The results of sub.plots.m are shown in Fig. A.2. MATLAB code is shown below. It 
explains the creation of subplots. 

7. File name: sub_plots.m 

i Creation of subplots with different properties, 
clear all 

7. Generate data for plot 
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0 5 10 10” 1 10° 10 1 

Creation of subplots with different properties. 


*/. number of plotting points 
"/. generates points on x-axis 
"/, function to be plotted 
% division of Figure window 
"/„ basic plot shown in top-left window 
% size of tick marks on both axes 
"/. stop to contemplate effects 
•/. The above function is now plotted on different log scales 
•/. and plots are placed in the remaining sub-windows 
subplot(2,2,2) 

’/. log scale on x-axis 
’/. size of tick marks on both axes 


n = 100; 

x = linspace(0,10,n); 

y = 3*x.~2; 

subplot(2,2,1) 

hi = plot(x,y); 

set(gca,’FontSize’, 10) 

pause 


h2 = semilogx(x,y); 
set(gca,’FontSize’,14) 
subplot(2,2,3) 
h3 = semilogy(x,y); 
set(gca,’FontSize’,15) 
h4 = loglog(x,y); 
set(gca,’FontSize’,16) 


"/„ log scale on y-axis 
subplot(2,2,4) 

"/, log scale on both axes 
% size of tick marks on both axes 


°/ Below we set new thicknesses of plotting lines 
set(hi,’LineWidth’,1.5); set(h.2,’LineWidth’,2); 
set(h3,’LineWidth’,2.5); set(h4,’LineWidth’,3.5); 
pause 
close all 


Types of lines (both colours and symbols) are summarized in Table A.3. Colours are 
automatically selected by MATLAB. 
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A.4.3 Some 3D plots 


We show how to put several 2D plots on one graph and make it look like a 3D plot. The 
code is shown below and the generated graph is as Fig. A.3. 

’/, File name: pview.m 

i Allows multiple 2D plots to be stacked next to one another 
’/„ along one dimension; also provides 3D view of all plots 
clear all 

x = linspace(0,3*pi).’; % x-axis data 

Z = [sin(x) sin(2*x) sin(3*x) sin(4*x)]; 

’/, Code below gives each curve different value on y-axis 
Y = [zeros(size(x)) ones(size(x))/3 (2/3)*ones(size(x)) ones(size(x))]; 
plot3(x,Y,Z,’LineWidth’,1.5) 
grid on 

xlabel(’x’,’FontSize’ , 14) 
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ylabel(’y’,’FontSize’,14) 
zlabel(’z’,’FontSize’,14) 

set(gca,’FontSize’,14); ’/, size of tick marks on both axes 

view(-40,60) 

pause 

close all 


A.5 Basic input-output 


A.5.1 Writing to a text file 


To save the results of some calculations to a file in text format requires the following steps: 

a) Open a new file, or overwrite an old file, keeping a ‘handle’ for the file. 

b) Print the values of expressions to the file, using the file handle. 

c) Close the file, using the file handle. 

The file handle is just a variable which identifies the open file in your program. This 
allows you to have any number of files open at any one time. 

% File name: file_write.m 
°/o open file 

fid = fopen(’myfile.txt’, ’wt’) ; °/„ ’wt’ means "write text" 

if (fid < 0) 

error(’could not open file "myfile.txt"’); 
end; 

for i=l:10 °/„ write to file 

fprintf (fid,’Number = °/ 0 3d Square = °/ 0 6d\n’ , i, i*i) ; 
end; 

fclose(fid); °/„ close the file 

A.5.2 Reading from a text file 


To read some results from a text file is straightforward if you just want to load the whole 
file into the memory. This requires the following steps: 

a) Open an existing file, keeping a ‘handle’ for the file. 

b) Read expressions from the file into a single array, using the file handle. 

c) Close the file, using the file handle. 

The fscanf() function is the inverse of fprintf(). However, it returns the values it reads as 
values in a matrix. You can control the ‘shape’ of the output matrix with a third argument. 

'/, File name: file_read.m 
’/, open file 
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fid = fopen(’myfile.txt’, ’rt’) ; '/, ’rt’ means "read text" 
if (fid < 0) 

error(’could not open file "myfile.txt"’); 

’/. read from file into table with 2 rows and 1 column per line 
out = fscanf (fid,’Number = ”/,d Square = "/,d\n’ , [2, inf] ) ; 
fclose(fid); '/, close the file 

xx = out’; '/, convert to 2 columns and 1 row per line, 

xx '/, output results to screen 


A.6 Numerical differentiation 


We will illustrate numerical differentiation using MATLAB function diff(x). It computes 
the difference between elements in an array. 

Given a vector x = {..in, x k , x k+ \...) containing points x k , one can derive approxima¬ 
tions to the first derivative f{x) = ^ of the function y = f (x). Three basic possibilities 
are [4] 


Backward difference, 


(A.l) 


x k ~ X k -1 

Forward difference, 


(A.2) 


x k+1 - X k 


Central difference, 

f l(x ^ _ f(x k +\) - f(x k -\) 

(A.3) 


Xk +1 - Xk -1 

To compute first derivative we use MATLAB function diff(x), which determines the 
difference between adjacent values of the vector x. The derivative is then determined as 


dy = diff(y)./diff(x); 


The returned array of differences contains one less element than the original array. One 
should observe that trimming the last value ofx produces a forward difference and trimming 
the first value gives a backward difference. 

To obtain the second derivative, we apply the above algorithm a second time. Using 
backward difference, one has 


f"(x k ) 


f(x k ) - 
x k - x k -\ 


(A.4) 


“/, File name: deriv.m 

•/. Program evaluates first and second derivatives and plots results 
clear all 
font_size = 18; 

N_max = 190; 

x = linspace(0,2,N_max); 
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yl = sin(pi.*x); 

hi = plot(x,yl); % plot of original function 

xlabel(’x’,’FontSize’,font_size); 
ylabel(’Original function’,’FontSize’,font_size); 
grid on 
pause 
% 

tempi = yl; 

dyl = diff(tempi)./diff(x); 
xnewl = x(l:length(x)-l); 

h2 = plot(xnewl,dyl); % plots first derivative 

xlabel(’x’ , ’FontSize’ ,font_size) ; 
ylabel(’First derivative’,’FontSize’,font_size); 
grid on 
pause 
% 

temp2 = dyl; 

dy2 = diff(temp2)./diff(xnewl); 
xnew2 = xnewl(1:length(xnewl)-l); 

h3 = plot(xnew2,dy2); % plots second derivative 

xlabel(’x’,’FontSize’,font_size); 
ylabel(’Second derivative’,’FontSize’,font_size); 
grid on 

close all 


A.7 Review questions 


1. How are variables represented in MATLAB? 

2. What is the role of the semicolon (;) operator in MATLAB? 

3. What does MATLAB function linspace( 0,1, 10)? 

4. What is the role of dot (.) operator in MATLAB? 

5. What does MATLAB function subplot (3, 2, 1)? 

6. Perform a polar plot. 

7. Estimate n using the random method. 
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In a book like this, the development of computer programs for various tasks and also 
execution of simulations for different processes and devices, plays an essential role. The 
fundamentals of many computer programs are supported by numerical methods. Therefore, 
in this Appendix we summarize main elements of numerical analysis with an emphasis on 
methods related to the development of programs used in this book, and also to understanding 
of operation of those programs. 

There are many excellent textbooks devoted to numerical analysis. We found the books 
by Koonin [1], DeVries [2], Garcia [3], Gerald and Wheatley [4], Rao [5], Heath [6] and 
Recktenwald [7] of significant pedagogical value. The books by Press et al. [8] stand on 
their own as an excellent source of practical computer codes ready to use. 

We concentrate on description and implementation of some practical numerical methods 
and not on the problems which those methods are typically used for. We start our discussion 
with a summary of methods of solving nonlinear equations. 

There are many textbooks aimed to the introduction of numerical methods and their ap¬ 
plications. Some of the most popular are: Applied Numerical Analysis Using MATLAB by 
Fausett [9], Numerical Methods for Physics by Garcia [3], Introduction to Scientific Com¬ 
puting by van Loan [10], Advanced Engineering Mathematics with MATLAB by Harman 
et al. [ 11 ], A Friendly Introduction to Numerical Analysis by Bradie [12]. 

They contain extensive code written in MATLAB (sometimes also in other languages) 
which should help with understanding and could be easily adopted to a particular problem. 

Our aim here is to summarize some of the numerical methods and techniques which are 
directly relevant to the problems discussed in the main text. 

In Table B.l we provide a list of MATLAB files created for this Appendix B and a short 
description of each function. 


B.l One-variable Newton's method 


The very efficient and popular method, also known as Newton-Raphson method, is used to 
find roots. Here we establish its principles by showing how to find root x* for an arbitrary 
function of single variable /(x) such that 


fix*) = 0 


(B.l) 
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Table B.1 List of MATLAB functions for Appendix . 


Listing Function name Description 


B.l muller.m 

B.2 dmuller.m 

B.3 fmuller.m 

B.4 ode single.m 

B.5 func-ode.m 

B.6 func-odexsys.m 

B.7 odesys.m 

B.8 square-wave.m 

B.9 FT ^example _1. m 

B.10 FT-example d2.m 


Function implements Muller’s method 
Driver to test Muller’s method 
Test functions for Muller’s method 
Driver for RK method (single eq.) 
Function called by odesingle.m 
Function called by odesys.m 
Driver for RK method (system of eqs.) 
Implements Fourier series for square wave 
Preliminary Fourier transform 
Testing Fourier transform in MATLAB 


M 



Graphical illustration of Newton's method. 


The Newton method is based on a linear approximation of the function by employing a 
tangent to this function at a particular point. In Fig. B. 1 we provide graphical illustration 
of Newton’s method. 

One starts from an initial guess point, say x\, which should be not too far from root x*. 
Evaluating tangent at point jci and its intersection with the x axis creates new point X2 and 
the process is repeated. The relevant mathematical formulas are obtained by writing the 
Taylor expansion of / (x) around initial guess xi: 

fix*) = f{x i - Sx) = fix i) - ^ Xl) <$x + D(5x 2 ) (B.2) 

ax 

In the previous, /(x*) = 0. Solving for <5x, one finds 


fix l) 


From Fig. B. 1 one observes that point X2 can be obtained 


X 2 = XI — <5x = 


fix l) 
fix i) 


(B.3) 


The procedure can be generalized and for arbitrary step n, one has 


fiXn) 

fix n Y 


n= 1,2,... 


(B.4) 


i+i ~ 








422 


Summary of basic numerical methods 


B.2 Muller's method 


Muller’s method is an efficient technique of finding both real and complex roots of a 
scalar-valued function /(x) of a single variable x where there is no information about 
its derivatives. It uses quadratic interpolation involving three points [13]. The method is 
described in several textbooks, including Atkinson [14] and Fausett [9], 

In order to illustrate how the method works, let us start with three initial guesses xq, xi , x 2 . 
By applying Muller’s method, a new improved approximate point X3 is determined. In this 
method one uses quadratic function y(x) which is used in fitting original function /(x). 
Function y(x) is of the form 

y(x) = a (x - x 2 ) 2 + b (x-x 2 ) + c (B.5) 

It contains three constants a, b, c which can be determined by evaluating Eq. (B.5) at points 
xq, x\, x 2 . One obtains 


for x = x 2 

fix 2 ) = C 


for x = Xi 

fix 1) = a (Xi -X2) 2 +b (xi 

~x 2 )+c 

or 

fix\) ~ fix 2 ) = a (x 1 -x 2 ) 2 

+ b (x 1 - x 2 ) 

for x = x 0 

/(x 0 ) = a (x 0 x 2 ) 2 + b (x 0 

-x 2 ) + c 

or 

fix 0) - f(x 2 ) = a (x 0 - x 2 ) 2 

+ b (x 0 - x 2 ) 


Introduce notation tio = xq — x 2 and u\ = x\ — x 2 , which allows us to write the above 
equations as 

fix 0) - f(x 2 ) = aul + buo (B.6) 

f(x l )-f(x 2 ) = au 2 l +bu 1 (B.7) 

The system of Eqs. (B.6) and (B.7) can be solved by any standard method and one obtains 
_ U\ [/(*0) - /fe)] - up [fix 1) - fix 2)\ 

(x 0 - X 2 ) (xi - x 2 ) (x 0 - Xi) 

b _ «o [fjx\) - fix 2)] - u\ [/(X 0) - fix 2)] 

(*0 - X 2 ) (X! - X 2 ) (x 0 - xi) 

With the above information at hand and using Eq. (B.5), one can determine roots of 


yix ) = 0 


in order to generate a new approximation of the root of /(x). One obtains 


x-x 2 = 


— b± \ft? — 4 ac 
2 a 


(B.8) 


Out of the above two roots we choose the one which is closest to x 2 . To avoid round-off errors 
due to subtraction of nearly equal numbers, one multiplies numerator and denominator of 
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(B.8) by —b =F *Jb 2 — 4ac and obtains (we call new solution x by x 3 ) 


*3 “*2 


(—b ± V* 2 — 4 acj (—b =p Vb 2 — 4acj 

2a ( — b =p Vb 2 — 4acj 


—b ± y/b 2 — 4 ac 


(B.9) 


Once x 3 is determined, the ‘oldest’ point xo is ignored by setting xo = xi, xi = X2, X2 = x 3 
and the process is repeated. MATLAB implementation of Muller’s method using the above 
algorithm is shown below. 


“/, File name: muller.m 

function out = muller(f, xO, xl, x2 , epsilon, max) 

Finds zeroes using Muller’s method, good for complex roots. 
’/, Variable description: 

’/, out - result of search 

’/, xl,x2,x3 - previous guesses 
’/, epsilon - tolerance 
y max - max number of iterations 


yO = f(xO); 
yl = f(xi); 
y2 = f(x2); 
iter = 0; 

while (iter <= max) 

iter = iter + 1; 

a =( (xl - x2)*(y0 - y2) - (xO - x2)*(yl - y2)) / ... 

( (xO - x2)*(xl - x2)*(xO - xl) ); 

l 

b = ( ( xO - x2 )~2 *( yl - y2 ) - ( xl - x2 )‘2 *( yO - y2 )) / ... 
( (xO - x2)*(xl - x2)*(x0 - xl) ); 

c = y2; 

y 

if (a ~= 0) 

disc = b*b - 4*a*c; 
ql = b + sqrt(disc); 
q2 = b - sqrt(disc); 
if (abs(ql) < abs(q2)) 
dx = - 2*c/q2; 


dx = - 2*c/ql; 

end 

elseif (b ~= 0) 
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x3 = x2 + 
xO = xl; 
xl = x2; 
x2 = x3; 


dx; 


yO = yi; 
yl = y2; 
y2 = f(x2); 

1 

if (abs(dx) < epsilon) 
out = x2; break; 


end 

Driver for the Muller method is shown below: 

% File name: dmuller.m 

% Driver to test Muller’s method 

clear all 

format short 

max = 100; 

epsilon = le-6; 

% starting points 
xl = 0.5; 
x2 = 1.5; 
x3 = 1.0; 

% call to Muller’s method 

out = muller(Ofmuller, xl, x2, x3 , epsilon, max) 

Test functions for the Muller method are shown below. One can make appropriate choices. 

% File name: fmuller.m 
function out = fmuller(x) 

% Test functions for Muller’s method 
out = x. ~6 - 2; 

'/.out = x.~10 - 0.5; 

°/,out = x - x."3/3; 

°/ 0 out = x.~3 - 5*x.“2 + 4*x; 


B.2.1 Tests of Muller's method 


Using the above functions, we tested Muller’s algorithm on two functions: 

1. calculations of \fl, after Fausett [9]. We define function fix) = x 6 — 2, and assume 
starting values to be: xq = 0.5, x\ = 1 .5, *2 = 1 .0. Obtained solution is x — 1 . 1225. 
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2. finding root of f(x) = e* + 1 = 0. The analytical method gives x = in. Assuming 
starting values xq = 1, x\ = 0, x 2 = —1, one obtains from Muller’s method x = 
-0.0000 + 3.1416 i. 


B.B Numerical differentiation 


Very often scientific and engineering problems are formulated in terms of differential 
equations. To solve them numerically, we need to establish schemes to replace derivatives 
by finite differences. 

A typical scientific or engineering problem is described by the second-order differential 
equation. Examples include Newton’s second law mx = F, or Laplace’s equation V"(x) = 
0. General second-order differential equation is written as 


d 2 y(x) , dy{x) ... 

, , + a(x )—— = b(x) 

ax 1 ax 


with a (x) and b(x) known functions. The above can be rewritten as two first-order equations: 


dyjx) 

dx 


z(x) 


- = b(x) - a(x)z(x) 


where z(x) is a new variable. The process can be generalized for a differential equation 
of arbitrary order, say n — th order, and when applied, results in n first-order differential 
equations, as 


- =f(x,yi,y 2 , 


where the functions f on the right-hand side are known. 

In order to have a well-defined mathematical problem, the above set of equations must 
be supplemented by boundary (or initial) conditions. Boundary conditions are algebraic 
conditions on the value of the functions y, to be satisfied at discrete specified points, but 
not between those points. 

The (first) derivative of a function / (x) is defined as 


df (x) I 
dx L 


= /'(+>) 


fix) - f(.X 0 ) 
x-x 0 


For a finite (and nonzero) value of Ax = x—xq, the above derivative can be approximated 


fix o) 


fix) ~ fjxp) 

Ax 


In Table B.2 we summarize common finite-difference formulas. 
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Table B.2 Summaryof common finite-difference formulas. 


Type of approximation 

Formula 

Truncation error 

Forward differences 

r fm-f 

Jl h 

f +2 -2f i+l +f 

Ji h 2 

OQi) 

Backward differences 

f-f l 
'• h 

f — 2f-\ + f-2 

Ji h 2 

0(h) 

Central differences 

r! f +1 f— 1 

Jl ~ 2h 

f+i ~ If + f-i 

Ji h 2 

0(h 2 ) 



B.3.1 Numerical differentiation using Taylor's series expansion 

We want to determine a derivative of function / (x) at point x, and also its second derivative. 
Using notation from Fig. B.2, we write Taylor expansions 

fi+x = ft + hf! + hi 2 f!' + h 3 fl" + 0(h A ) (B. 10) 

f-i = f~ hf! + hi 2 f!' - h?f!" + 0(h A ) (B. 11) 

where the following notation has been used: 

f n = f{x n ), x n = n-h, n = 0,±1,±2,... 

All numerical schemes defining the first derivative are derived from the above equations. 
From Eq. (B.10) we obtain the forward formula 

_ f+i-f + 
h 

From Eq. (B. 1 1) we obtain the backward formula 

fl = f + 0(h ) 
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Subtracting both, we obtain the central formula 

r, _ f+i ~ f—i 
Ji ~ 2 h 

Taylor expansion for next points (i ± 2) gives 

f+2=f + 2 hfl + i (2 h) 2 f[' + 1 (2hf f" + Of/r 4 ) (B.12) 

and 

1-2 =f~ 2 hf’ + ~ (2h) 2 f" - l (2h) 3 f!" + 0{h A ) (B. 13) 

2 o 

Multiplying Eq. (B.10) by 2 and adding to Eq. (B.12), we obtain 

1+2 ~ 2f m = f + 2hfl + 2h 2 f" - 2/ - 2hf! - h 2 f," + 0(h 3 ) 

= ~f + h 2 f" + 0(h 3 ) 

or 

ft, _ f+2 ~ 2f + \ + f 

Ji ~ h 2 

Other schemes can be obtained in a similar way. 

B.3.2 Numerical differentiation using interpolating polynomials 


To illustrate a basic concept, consider a second-order polynomial /(x) passing through 
three data points, Fig. B.3: 


(Xi, f) , (Xi+uf+l) , (x i+2 ,f +2 ) 
The analytical form of that polynomial is 


f(x) = a 0 + a\x + a 2 x 1 


(B.14) 












428 


Summary of basic numerical methods 


Function / (x) has to pass through all three points defined previously. This requirement 
creates three equations: 

fi = «0 + CL\Xi + a 2 f 
Jm = ao + ai (Xi + h) + a 2 (x t + h) 2 
f +2 = ao + ai (xj + 2 h) + a 2 (x t + 2hf 

The above equations will now be solved to find coefficients ao, ci\, a 2 . The scheme must 
be independent of the choice of point x,-, so we can choose x, = 0, which significantly 
simplifies algebra. One obtains 

ft = ao 

f+l = a 0 + aih + a 2 h 2 
f+ 2 = ao + 2a\h + 4 a 2 h 2 

The solutions are 


ao = f 

-f +2 + 4f +l -3f 


a\ m - 


a 2 = - 


2 h 

f+ 2 ~ 2f+i + f 
2 h 2 

Differentiate Eq. (B.14) and use Eqs. (B.15) to obtain 

-f+2 + 4f+i - 3ft 


2 h 


fi = f(xi = 0) = «i = - 

/? = f\x, = 0) = la 2 =&iff±* 

The above are forward-difference approximations of the order 0(h 2 ). 

Given a vector x = {... x*-i> Xt, xt+i ...} containing points x*, one can derive approxi¬ 
mations to the first derivative //(x) = ^ of the function y = f(x). Three basic possibilities 
are [1]: 


Backward difference, 

f'(x k ) 

_ f(x k ) - f(pCk- 1 ) 

(B.16) 


X k - x k -l 


Forward difference, 

f(x k ) 

_ f(Xk+\) - f{x k ) 

(B.17) 


X k +x —X k 

Central difference, 

_ f(Xk+\) - f(x k -\) 
f(x k ) « 

(B.18) 


To compute first derivative we use MATLAB function diff(x), which determines the 
difference between adjacent values of the vector x. The returned array of differences 
contains one less element than the original array. One should observe that trimming the 
last value of x produces a forward difference and trimming the first value gives a backward 
difference. 
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To obtain the second derivative, we apply the above algorithm a second time. Using the 
backward difference, one has 


f"(Xk) 


/'(**) ~ f'(Xk- 1 ) 
x k - in 


(B.19) 


B.3.3 Crank-Nicolson method 


We illustrate the method on the diffusion equation, which we write in the form 
3 u 3 2 u 
3 1 dx 2 


(B.20) 


The Crank-Nicolson (CN) method is the finite-difference discretization of the above equa¬ 
tion. It was invented to overcome stability limitations and to improve the rate of convergence 
of the solution of diffusion equation. It has O (At) 2 convergence rate compared to O (At) 
of the implicit and explicit methods. 

The CN method is essentially an average of the implicit and explicit methods. It is 
created as follows. Applying a forward difference approximation to the time derivative in 
the diffusion equation gives the explicit scheme 


(Ax) 2 


When using backward difference to the same equation, one creates the implicit scheme 


u" +l - m" wTf/ - 2m? +1 + m" + / , 

+ O (At) = ,+1 ' i ~ 1 + O (Ax) 2 


(Ax) 2 


The algebraic average of the two schemes gives 
u" +l - w” if u n i+ j - 2 m" + m"_j 


< +1 -o“«t 1 -2< +1 +< 1 1 ) = 


- Ki 


h O (Ax) 1 
(B.23) 


From the above equation one finally obtains the Crank-Nicolson scheme 


-2m? + < 1 ) 


where a = One can show (see Problems) that the scheme is accurate to O (At) 2 

rather than O (At). Note that in CN scheme m"+/, m" +1 , m"^ 1 are determined implicitly in 
terms of u" +1 , u ", M"_ r 


B.3.4 Simple methods of numerical differentiation 
Euler method 

The simplest method is the Euler method, which is 


y n+ 1 =y n + hf(x n ,y n ) 
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which advances a solution from x n to x n+ \ = x„ + h. It is not a practical method, because it 
is not very accurate and also not very stable (see discussion below). 

Let us illustrate several methods which originate from the Euler method. We will use 
definition of an acceleration in one-dimensional motion as an illustration. It is 


d 2 x{t) 

dt 2 


a(x, v) 


In general, acceleration can depend on position and velocity. First, write the above equation 
as a system of two first-order differential equations: 


— = a(x, v) (B.25) 

dt 

dx _ 
dt ~ V 


Introduce r as a time step. The right-derivative formula for a general function g(f) is 


dg(t) _ g(t + r) - g(t) 
dt x 

Application of that formula to a system (B.25) gives 

v(f + r)-v(0 +O(T ) = 

*q + T)-,(0 +O ( T ) = 


+ O(X) 

a(x(t),v(t)) 

v(0 


We have explicitly indicated the dependence of all quantities involved on the specific time. 
From the above 


v(t + x) = v(t) + x ■ a(x(t), v(t)) + 0( x 2 ) 
x(t + t) = x(t) + x • v(f) + 0{x 2 ) 


For the future use, let us introduce the following notation: 

gn = g((n — l)r), n= 1,2,3,... (B.26) 

and 

gi = g(t = 0) 

In the above, n refers to time steps. In our notation, the Euler method for the acceleration 
problem reads 


The algorithm based on Euler method goes as follows: 

1. specify initial conditions (i.e. values corresponding to t = 0) x\ and Vi 

2. choose time step x 

3. calculate acceleration for current values of x n and v„ 
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4. use Euler method to compute new x and v 

5. loop through step 3, until all time steps are done. 

An implementation of the Euler method is left as a problem. 

Euler-Cromer (E-C) and mid-point methods 

Both of these methods are simple modifications of the Euler method just discussed. E-C 
method consists in replacing v„ by v„ + i, the value of velocity at the next time step (it must 
be calculated first, so the calculations must be done in a proper sequence). Explicitly, the 
E-C method is 


Xn+ 1 = x„ + t ■ v„+i 

Another possible modification (and in a similar spirit) results in a mid-point method: 
v»+i = v„ + r • a„ 
x n+ i = x n + r ■ ^(v« + v„+i) 

where the last term represents algebraic average of velocities at present and the next time 
steps. 


Leap-frog method 


The development of this method starts with Eqs. (B.25). Use the central derivative approach 
to write (note that central time value is different for both equations) 


T( , +t) - T( ,_ I) +o(t2) 

,(, + 2Q— 

Applying notation (B.26), the above system reads 


= a(x(t}) 
= v(f + r) 


or 


v„ + i = v„_i + 2t ■ a(x n ) 
x n+1 = x„ + 2x ■ v„+i 

In order that this scheme works, we need to know values at proper initial points. Note 
that position and velocity are evaluated at different points, as 

position is evaluated at points: x \, X3, X5,... 
velocity is evaluated at points: V2.V4.V6,... 
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Verlet method 


To develop the Verlet method, instead of using Eqs. (B.25), we will write them as [3] 
dx 
dt 

d 2 x _ 
dt 2 

Use central difference formulas for first and second derivatives and obtain 



OC T 2 ) = V„ 
0 ( r 2 ) = 


From above, we obtain formulas which define Verlet method: 



x„+i = 2x n - x n —i + x 2 a n 


B.4 Runge-Kutta (RK) methods 


B.4.1 Second-order Runge-Kutta 


Our goal is to solve linear system of linear differential equations 

^ = ?<v w ,o 

with vector x expressed as x = [X1X2 .. .x n \. In order to establish second-order RK 
method, let us start with the simple Euler approximation to the first derivative (which is 
also Taylor expansion to first order): 

■?(f + T) = -?(0 + T.7o?(f),f) 

where x is some arbitrary time step and t < £ < t + x. We have freedom to choose £ 
to have the most convenient value for us. The second-order RK method originates when 
£ = t + r/2 and also 

~x {t + x) = ~x (t) + x -~f (~x*(t + x/2), t + x/2) 


-f*(t + x/2 ) = -? (0 + 1/2 • r • ( t), t) 


where 








433 


Solving differential equations 


B.4.2 Fourth-order Runge-Kutta 

This is one of the best schemes available and is commonly used in practice. It is defined as 
~x(t + t) = ~Z{t) + 1/6 • x ■ (P\ + iP2 + 2^3 + ^ 4 ) 

where 

= 7ct,t) 

?2 = JC? + T/2-? U t + T/2) 

^ 3 = 7ct+x/2-? 2 ,t+x/2) 

^ 4 = 7cZ + T-~F 3 ,t + T) 

MATLAB has some built-in routines implementing RK methods. 

B.5 Solving differential equations 


Here we provide some practical methods of solving ordinary differential equations (ODE), 
both single and systems, using MATLAB function ode45.m. 

B.5.1 Single differential equation 


To start with, consider an equation with an initial condition 

-^ = x(t) 0 - 1^ , x(0) = 0.009 (B.27) 

We want to solve it and plot the solution in the interval [0.1, 10]. MATLAB code is divided 
into two m-files: one (the driver) contains all needed parameters and calls the appropriate 
m-function which defines the function. 

"/» File name: ode_single.m 

"/» Illustrates application of R-K method to a single dif f. equation 
7. Results are compared with an exact solution 

I. 

clear all 

tspan = [0.1 10]; 7 0 time interval 

xO = 0.009; 7„ initial value 

7. 

[t,x] = ode45( , func_ode_single’,tspan,xO); 
t_ex = linspace(0.1, 10, 100); 

x_ex = t_ex. ~2. *exp(-t_ex) ; 7o Exact solution 

plot(t,x,t_ex,x_ex,’.’,’LineWidth’,1.5) 







434 


Summary of basic numerical methods 



xlabel(’t’,’FontSize’,14) 
ylabel(’x’,’FontSize’,14) 

set(gca,’FontSize’,14); “/, size of tick marks on both axes 

pause 

close all 

The function itself is 

function xdot = func_ode_single(t,x) 

’/, Function called by ode_single.m 
xdot = x*(2/t -1); 

The result of solving the above problem is plotted in Fig. B.4 along with an exact solution, 
which is x(t) = t 2 e~ l . 


B.5.2 System of differential equations 


To illustrate application of ode45.m function in solving systems of differential equations, 
consider the Lorenz model [3] defined by the following system: 


dx i (Q 
dt 

dx 2 (t ) 
dt 

dx 3(f) 
dt 


a (xz-Xi) 
rx i — x 2 — X1X3 
xix 2 — bx 3 


(B.28) 


where cr,r,b are positive constants. This system was originally developed to describe 
buoyant convection in a fluid. It shows chaotic behaviour. The following initial conditions 
are assumed: xj (t) = l,x 2 (t) = 1, X3 (t) = 10.04. The analysis is conducted in the time 
interval [0, 10]. That system of equations is coded as 
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function xdot = func_ode_sys(t,x) 
"/. Function called by ode_sys.m 


sigma = 10; b = 7/3; r = 25; °/„ parameters 


xdot(1) = sigma*(x(2) - x(l)); 
xdot(2) = r*x(l) - x(2) - x(l)*x(3); 
xdot(3) = x(l)*x(2) - b*x(3); 
xdot = xdot’; 

The MATLAB code of the driver of the above system is 
°/o File name: ode_sys.m 

7„ Illustrates application of R-K method to system of diff. equations 

7. 

clear all 

tspan = [0 10] ; 7« time interval 

xO = [1, 1, 10.04]; 7. initial value 

7. 

[t,x] = ode45( , func_ode_sys ) ,tspan,xO); 

plot(t,x(:,1), t,x(:,2), t,x(:,3),’LineWidth’,1.5) 

xlabel(’time’,’FontSize’,14) 

set(gca,’FontSize’, 14) ; 7o size of tick marks on both axes 

pause 

close all 

The results of two runs using the above functions are shown in Fig. B.5. 

B.6 Numerical integration 


The problem is to calculate a numerically definite integral 

/ = J f(x)dx 

The interval [a, b ] can be split into equidistant subintervals with total number N = 
with h separation between neighbours. The above integral is thus replaced by 

I = f f(x)dx = p f(x)dx + p f{x)dx + ■ ■ ■ = J2 A i 

where A, is the area of an elementary element. In the following, we therefore only consider 
an integral within a single subinterval: 

Ai = J f(x)dx 

Different schemes will originate depending on how we evaluate the above integral. 


(B.29) 
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Solution of a system of ordinary differential equations showing chaos for two different values of initial conditions; 
[1,1,10.04] (top) and [1,1,10.02] (bottom). 


B.6.1 Euler's rule 


Subarea is defined within an interval x;_i < x, < x,-+i. The integral (B.29) thus takes the 
form 

At = jT f(x)dx « h-f 

with fi = f (Xi) taken at the right end of subinterval. The value of the function can also be 
taken at the left point, or in fact at any point within the subinterval xi- \ < Xt < x,+i. Total 
area is thus 

7 * 


= h-(f x +f 2 +■■■/„) 
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B.6.2 Trapezoidal rule 

In this method the integral is approximated by a series of trapezoids. The area of each 
subinterval is evaluated as 

At = J f (x)dx « (fi—i + f ) • (Xj - x f - 1 ) 

with h=Xi~ Xj—[ . Taking h as a constant, the integral is 

I = j f(x)dx +fi)‘ h 

= 2 ^ T C/o + /l ) + (/l + fl) + C/2 + h) H-+ (/n-2 + fn- 1) + C/h— 1 + /n)} 

= ^ {/o + 2/! + 2/2 + ... 2/„_i + /„} 

In a condensed form, the integral is 

I = hJ2f+\h(fo + f n ) 


B.6.3 Simpson's rule 


Simpson’s method gives more accurate results than trapezoidal rule. Trapezoidal rule con¬ 
nects consecutive points by a straight line; Simpson’s rule connects three points. Normally, 
a second-order polynomial (a parabola) is used to do so. Consider one: 

p(x) = c 2 x 2 + C\X + Co 

Without loss of generality, we can take x, = 0. Then, x,_i = —h and x,-+1 = h. Evaluating 
the above polynomial gives 

p(-h) = c 2 h 2 - c x h + c 0 = f- 1 
P( 0) = c 0 = ft 
p(h ) = c 2 h 2 + c\h + c 0 = fi+\ 

Solving the above system gives 
co 
c\ 


= f 
= 


■ft- 1) 


C2 = ^ (/+!+/-I -2/) 
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Evaluating elementary integral for an area A, gives 



The complete integral is 



B.7 Symbolic integration in MATLAB 


MATLAB allows us to perform symbolic manipulations. It uses Maple, a powerful com¬ 
puter algebra system, to manipulate and solve symbolic expressions. To use symbolic 
mathematics, we must use the syms operator to tell MATLAB that we are using a symbolic 
variable, and that it does not have a specific value. 

In this book we only use symbolic manipulations to evaluate indefinite integral. We 
illustrate symbolic concepts of MATLAB with the example below. 


Example Evaluate integral of the function fix) = x 2 . 

The shortest method is to type on a prompt: >> int(x 2). The answer is am = jc3/3. 
Alternatively, we can define x symbolically first, and then remove the single quotes in 
the int statement. 

» syms x 
» int(x~2) 


ans = 


x~3/3 


B.8 Fourier series 


According to Fourier, any function can be expressed in the form known as the Fourier 
series: 



+ sinnf 


(B.30) 
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Fourier series representation is useful in describing functions over a limited region, say 
[0, T\, or on the infinite interval (— 00 , + 00 ) if the function is periodic. We assume that 
the Fourier series converges for any problem of interest, see Arfken [15]. Cosine and sine 
functions form a complete set and are orthonormal on any 2tc interval. Therefore they may 
be used to describe any function. The orthogonality relations are 


f 


sin mt sin nt dt = 


cos mt cos nt dt = \ 
sin mt cos nt dt = 0, 


J nS m ,„, mf=0 

(O, m = 0 

7T 8 m , n , m± 0 

2 7t, m = n = 0 

all integral m and n 


(B.31) 

(B.32) 

(B.33) 


One also has the relation 


(B.34) 


Using the above orthogonality relations, the coefficients of expansion in Eq. (B.30) are 

1 r 2n 

a„ = — I fit) cos nt dt (B.35) 

n Jo 


and 


b„ = ~ f fit ) s 
™ Jo 


(B.36) 


Also, the exponential form of the Fourier series is often used where the function is expanded 
as 


/«= E c,el" 


(B.37) 


Comparison of the above relations gives formulas for coefficients c n 
co= Jao n = 0 

c n = \ ia„ — ib „) n > 0 

c- n = | {a\ n \ + ib\»\) n < 0 


(B.38) 


B.8.1 Change of interval 


In the above formulation, the discussion has been restricted to an interval [0,2;r]. If the 
function / it) is periodic with the period 2 T, we may write 


where 


/ ix) = -cio + ^ a„ cos nt + b„ sin nt 

a n =—f fit') cos 1 ^~ dt', n = 0,1,2, 3,. 
fi-r T 


(B.39) 

(B.40) 
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.fit) 


T 2 T 


and 

b n =\j T fit') n= 1,2,3,... (B.41) 

Expressions (B.30) and (B.39) are related by obvious change of variables, namely t = y-. 
The choice of the symmetric interval [-T, T] is not essential; in fact, for periodic function 
with a period 2 T it is possible to choose any interval [f 0 , to + 27*]. 

B.8.2 Example 


Use the Fourier series to approximate a square wave function /(f) as shown in Fig. B.6. 
Using Heaviside step function H (f), the function / (t) can be written as 


where 


/<0 = 2[tf(t)-H(t-l)]-l 


H{t) = 


0 

1 


t < 0 
t > 0 


Determine coefficients a„ and b n in the Fourier series expansion. Write a MATFAB program 
which plots the above approximation for various number of terms in the series. 

From Fig. B.6 one observes that f(t) = f (2 T — f); i.e. function / (t) is odd. Therefore 
coefficients a„ vanish. Coefficients b„ are evaluated as follows: 

fc, = i f m ™ (f) ™ ("f) d,-\ f" sin (^) dt = 

= — J sin xdx - J sin xdx 

n odd 
n even 

The Fourier series is 


m = - E 

^ Jxs,.. 


( njrt\ 

"(t-J 


The MATFAB program which evaluates the above is shown overleaf. The result is shown 
in Fig. B.7. 
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Fourier series approximation to the square wave. Line with oscillations corresponds to N = 20 terms; the other one 
was obtained using 100 terms. 


7, File name: square_wave.m 
clear all 
L = 0.5; 

t = [0:100]/100; 
hold on 

for N = [20 100] ; 

f = zeros(1,101); 
for n = 1:2: N 

pref = 4/(pi*n); 
f = f + pref*sin(n*pi*t/L); 


7. arguments of independent variable 

7. function initialization 

7. evaluate prefactor 
7. summation of function 


plot(t,f,’LineWidth’,1.5) 

end 

axis([-0.2 1.2 -1.2 1.2]) 
ylabel(’f(t)’,’FontSize’,14) 
xlabel(’t’,’FontSize’,14) 

set(gca,’FontSize’, 14) ; 7. size of tick marks on both axes 

pause 

close all 


B.9 Fourier transform 


Based on the results from previous section, we now introduce Fourier transform (FT). 
Consider function /(/) to be nonperiodic on the infinite interval. FT is introduced by 
considering first a Fourier series representation (B.30) of the function which is periodic on 
the interval [0, T\, where T is the period. First, make substitution 
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in Eq. (B.37) and obtain 

fit) = Cn Cn = J J f {t) e ~ in ^ dt 

Next, introduce discrete angular frequencies 

71 

0)n = n f 

The difference between two successive frequencies A co is evaluated as 


Using the above definition, the original series given by Eq. (B.42) can be written a 

fit) = c ” e>ntAa ’ J ^ fit)e~ intAw dt 


Introducing new function g by the relation 


(B.44) 


one obtains 


ginAco) 


' I" 
*f2jr J-t 


fit) e 


(B.45) 


and 


fit) = 


sffjt 


AcoginAco) e mtAo> 


(B.46) 


Taking the limit T —*■ oo, summation becomes an integral and nAco becomes continuous 
variable co. One finds 

g(oj) = -J= J fit) e~ m, dt (B.47) 

and 

fit) =^f °°gi<o) e io>t dt (B.48) 

The above relations are known as the Fourier transform and inverse Fourier transform. 
There is a freedom in choosing prefactor in Eq. (B.44). By making different choices, one 
can introduce different Fourier transforms. 



FFT in MATLAB 


B.10 FFT in MATLAB 


Fast Fourier transform (FFT) in MATLAB is defined as 


X(k) = '^ j x{n)e- iln(k - VHn - X)IN . 1 <k<N 


(B.49) 


which corresponds to the analytical definition (B.47) hut without -^= factor. FFT in MAT¬ 
LAB is performed by the function fft.m. 

The inverse FFT (IFFT) which is performed by MATLAB function ifft.m is defined in 
MATLAB as 



(B.50) 


k= 1 


and it corresponds to the analytical definition (B.48). Again, the - 7 == factor is absent. Let 
us illustrate a simple application of FT in MATLAB. 


Example 1 As the simplest illustration of fft.m and ifft.m functions, we provide below 
MATLAB code to conduct FT and its inverse for a Gaussian pulse. In the end, one recovers 
original Gaussian pulse. 

°/ 0 File name: FT_example_l .m 

°/ 0 Doing preliminary Fourier transform in Matlab 

N = 320; 

T_0 = 10; 

T_domain = 200; 

Delta_t = T_domain/N; 

T = Delta_t*(-N/2:1:(N/2)-l); 

A_in = exp(-(T/T_0) . "2) ; “/, input gaussian pulse 

plot(A_in) °/ 0 plot of input gaussian pulse 




A_freq = fft(A_in); 

plot(A_freq) 

pause 

orig = ifft(A_freq); 
plot(orig) 
pause 
close all 


7, not a correct pulse 

7. (I explain it in the next example) 


7. plots original input gaussian pulse 


Example 2 As the previous example shows, Fourier transform in MATLAB must be done 
carefully. In this example we illustrate the workings of FFT in MATLAB. MATLAB code 
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y = exp(-x 2 ) F = ffl(y) 




Original Gaussian pulse (left) and and its FT (right). 


is shown in the following. First, a Gaussian pulse is created and plotted against a number 
of points along the x-axis. When needed, one can also plot it against x values. At the top of 
each figure (as a title) we provide an expression for the plotted function. 

As the second figure, we plotted normalized Fourier transform using MATLAB function 
fftQ. The plot does not resemble the expected result because the Fourier transform 
produces a complex function of the frequency. Vector F contains complex numbers and 
when you plot it in MATLAB, it plots the real part against the imaginary part. 

To correct for the above deficiency, in the third figure we plotted the above transform 
plus the MATLAB abs{ ) function. Altogether, both functions produce the desired result, 
i.e. magnitude of the Fourier transform of the Gaussian pulse. 

In the remaining two figures we performed the inverse Fourier transform of the created 
Fourier transform. 

Observe the use of function fftshift.m which puts the negative frequencies before the 
positive frequencies. The reason for using it is that when MATLAB function fft.m computes 
FFT, it outputs the positive frequency components first and then outputs the negative 
frequency components. 

Below we show full MATLAB code for this exercise and all the figures created starting 
with an initial Gaussian pulse. In Figs. B.8-B.10, we show Gaussian pulse in time and its 
Fourier transform. It was obtained with the following MATLAB program. 

’/, File name: FT_example_2.m 

’/, Testing Fourier transform in Matlab 

clear all; 

x=-4:0.1:4; °/, Interval of independent variable 

y=exp(-x.~2); % Define Gaussian function 

plot(y,’LineWidth’,1.5); '/. Plot Gaussian (Fig.l) 

xlabelf’Number of points’,’FontSize’,14); ylabel(’y(x)’,’FontSize’,14); 

set(gca,’FontSize’,14); % size of tick marks on both axes 
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abs(F) FT of a Gaussian: ffishifit(abs(F))) 




The next steps in the application of Fourier transform to the Gaussian pulse. After application of function absQ (left), 
and / ftshift() (right). 


ifft(abs(F)) Inverse FT: fftshift(ifFt(abs(F))) 




Steps in producing inverse Fourier transform of an original Gaussian pulse. After application of if f t () (left), and 
after application of fftshift(ifftO) (right). 

title(’y=exp(-x~2)’); 
pause 

t 

F=fft(y); % Calculate FFT 

plot(F,’LineWidth’,1.5); % Plot of FT (Fig.2) 

title(’F=fft(y)’); 

S 

plot (abs(F) , ’LineWidth’ , 1.5) ; ’/, Plot of abs value (Fig.3) 
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title(’abs(F)’); % Take absolute value and plot 

l 

plot(fftshift (abs(F)), ’LineWidth’, 1.5) ; "/, Use Matlab fftshift function 

xlabelC ’Spatial frequency’); ylabelC ’ IFI ’); "/, (Fig.4) 
title(’FT of a Gaussian: fftshift(abs(F)))’); 
pause 

l - 

"/, Take inverse Fourier transform 

/.- 

plot(ifft(abs(F)),’LineWidth’,1.5); % Use Matlab function ifft (Fig.5) 

title(’ifft(abs(F))’); 
pause 
1 

plot (fftshift (if ft (abs(F))),’LineWidth’, 1.5) ; "/, Use functions fftshift 
title(’Inverse FT: fftshift(ifft(abs(F)))’); % and ifft (Fig.6) 

xlabel(’number of points’); ylabel(’y(x)’); 

close all 


B.11 Problems 


1. Use Newton’s method to find roots of the following function: e* — 1 — cos3x. Plot it 
first and use the graph to determine the starting point used by Newton’s method. 

2. Verify orthogonality relations, Eqs. (B.35) and (B.36) (where the limits of integration 
are [0, 2 nj) for another limit, i.e. [—tt, n], 

3. Derive expressions for coefficients a„,b„, c n using new limits. 

4. Show that the Crank-Nicolson scheme is accurate to O(At) 1 2 3 . 

5. Use the 2D version of Euler method to analyse projectile motion [3], 

6 . Analyse the effect of scaling within Fourier transform in MATLAB. 
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absorption 
coefficient, 181 

in a two-level system, 168,169 
infrared, 109 

of power in photodetectors, 242 
spectrum, 243 
ultraviolet, 109 

acceptance angle, see critical angle 
active region, 173, 176 
in a VCSEL, 173 
air mass, 369 
amplifier gain, 206 

amplifier, erbium-doped fibre (EDFA), 
209-13 

typical characteristics, 213, 215 
amplifier, semiconductor (SOA) 
Fabry-Perot (FPA), 223 
travelling-wave (TWA), 223, 227 
antireflection (AR) coating, 50-1, 205 
half-wave layer, 53 
quarter-wave layer, 53 
array waveguide grating (AWG), 318 
asymmetry parameter, 67 
attenuation, see loss 

band gap, 243 
in solar cells, 376 
bandwidth 
3-dB, 207 

of a photodetector, 247 
of anNRZ signal, 144 
of an optical amplifier, 207 
of an RZ signal, 143 
of gain, 206,224 
Bessel functions 

modified, 113, 114, 129, 132, 133 
ordinary, 113, 129, 131, 132 
bit error rate (BER), 240, 252-7, 259, 
336 

bit-pattern independency, 242 
bit-rate transparency, 241 
boundary conditions 

absorbing (ABC) in ID, 272-5 
absorbing (ABC) in 2D, 277-9 
for a magnetic field, 37-8 
for an electric field, 36-7 
Mur’s first order, 272-3 


Bragg mirror, 53, 54, 62 
reflectivity spectrum, 57, 62 
Brewster’s angle, 49-50 

C-band, see transmission bands 

generation rate, 182 
leakage rate, 183 
lifetime, 184 
recombination rate, 183 
chirping, 192-3 
cloaking, 398-9 
coefficient of finesse, 29 
coordinate transformation, 398 
Courant-Friedrichs-Levy (CFL) stability 
condition, 270 

Crank-Nicolson scheme, 301, 302 
critical angle, 18, 30, 64, 106 
cross-gain modulation (XGM), 234-5 
cross-phase modulation (XPM), 233, 
235 

bias, 144 
dark, 246 

forward diode, 372 
injection, 143 
leakage, 182 

photocurrent, 242, 245, 371, 374 
reverse saturation, 374 
short-circuit, 369 
threshold, 143 
current density, 35 
cutoff wavelength 

in optical fibres, 122-3,128 
in photodiodes, 245 

Debye medium, 280 
depletion region, 370 
detection 
coherent, 240 
incoherent (direct), 240 
diffraction, 396 
dispersion 

equation of the waveguide, 66 
group velocity (GVD), 106, 124 
in free space, 24 
material, 124-8, 136, 280 
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Index 


modal, 124 
multipath, 108 
numerical, 269, 270, 283 
of a ID wave, 21 
of a Debye medium, 280 
of a Lorentz medium, 280 
of a pulse in optical fibre, 127-8 
waveguide, 124-127 
distributed Bragg reflector (DBR), 173, 

174 

Drude relation, 389 
dynamic range, 241 

E-band, see transmission bands 

effective index method, 89-92 

effective medium, 391, 394 

effective thickness of a slab waveguide, 77 

EH modes, 116, 117, 119 

electric 

boundary conditions, 36-37 
field intensity, 35 
flux, 35 

electron, see carrier 
emission 

amplified spontaneous (ASE), 215, 248, 
255 

spontaneous, 168, 169, 208 
stimulated, 168,169 
equivalent circuit, 193 
for bulk laser, 194-6 
for PIN photodetector, 245,250 
for solar cell, 373-6, 380 
error function, 231 
Euler differentiation method, 429-31 
Euler’s rule (integration method), 436 
Euler-Cromer differentiation method, 431 
evanescent waves, 396 
excited state, 168 
external modulator, 6, 193 
eye diagram, 337 

Fabry-Perot (FP) 

interferometer, 29-30 
resonance conditions, 171 
resonant modes, 171, 172 
resonator, 170 
Faraday rotator, 323 
Fast Fourier transform (FFT), 443-4 
Fermi golden rule, 180 
Fermi level, 176 
fibre Bragg grating, 318 
filter function, 258 
finite difference method, 425-9 
backward differences, 426 
central differences, 426 
forward differences, 426 
finite-difference (FD) approximation, 297-9 


length, 19,20 
plane, 19, 20 
point, 19 

four-wave mixing (FWM), 233, 352 
Fourier series, 438-9 
change of interval, 439—40 
of a square wave, 440-1 
Fourier transform, 441-2 

of a chirped Gaussian pulse, 142 
of a Gaussian pulse, 141 
of a rectangular pulse, 139, 140, 158 
of an electric field, 146 
Fresnel 

coefficients and phases for TE polarization, 44-7 
coefficients and phases for TM polarization, 

47-8 

reflection, 45, 47 
Fresnel approximation, 290 

gain 

approximate formula, 206 
differential, 178 
inanEDFA, 213 
in semiconductors, 177-81 
peak, 178,180 
ripple, 227, 237 
saturation, 190, 207, 216,234 
spectrum of semiconductor laser, 172 
Gaussian probability density function (pdf), 255 
generation of carriers 
in laser didoes, 183 
in PIN photodiodes, 244 
in solar cells, 371 
geometrical optics, 17-21 
Goos-Hiinchen (G-H) shift, 58-9, 397 
graded index, 20, 107 
grid, see mesh 
GRIN system, 20-1 
group 

delay, 124-5,148 
velocity, 32 
guides modes 
in optical fibres, 114 
in slab waveguides, 65-6 

HE modes, 116, 117, 119 
Helmholtz equation, 288, 289, 292, 299 
hole, see carrier 
hybrid modes, 116, 119 

impedance, 41 
interference 

in dielectric films, 26-7 
intersymbol (ISI), 255 
multiple, 27-9 

invisibility cloak, see cloaking 
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L-band, see transmission bands 
large-signal analysis (in laser diodes), 192 
laser diode 

distributed feedback laser (DFB), 173,174 
in-plane laser, 172 

vertical cavity surface-emitting laser (VCSEL), 
172, 173 

leap-frog differentiation method, 431 
left-handed materials (LHM), see negative index 
materials (NIM) 

lens 

perfect, 396 
thin, 19 

linear system, 338 

linewidth enhancement factor (Henry factor), 228 
Lorentz medium, 280 

extrinsic, 1 10 
in optical fibres, 108, 129 
intrinsic, 109-10 
LP modes, 119-20 

Mach-Zehnder interferometer (MZI), 235 

magic time step, 270 

magnetic 

boundary conditions, 37-8 
field intensity, 35 
flux, 35 

Maxwell’s equations, 262 
differential form, 35 
in cylindrical coordinates, 111-12 
integral form, 36 
source-free, 69, 388 

ID generation algorithm, 102 
staggered grid, 266 
Yee grid in ID, 266, 267 
Yee grid in 2D, 276 
mid-point differentiation method, 431 
mode number 
azimuthal, 116 
radial, 116 
modulation format 

non-retum-to-zero (NRZ), 143-5 
retum-to-zero (RZ), 143-5 
modulation of semiconductor lasers, 143, 144 
modulation response function, 188, 190-2 
Muller’s method, 100, 422-5 

negative index materials (NIM), 385, 386 
Newton’s method, 420-1 

Gaussian, 255-7 
in optical amplifiers, 208-9 
in photodetectors, 248-9 
shot, 249 

thermal (Johnson), 249-50 


nonlinear Schrodinger equation, 353-7 
normalized guide index, 67, 94 
Numerical aperture (NA), 65, 106-7 

O-band, see transmission bands 
optical black holes, 399-400 
optical cavity, 168 
Fabry-Perot, 170 

optical communication system, 331-3 
optical coupler, 319-22 
optical fibre 
bit-rate, 108 
extrinsic loss, 110 
intrinsic loss, 109-10 
modes, 116-17 
single-mode, 122-3 
optical isolator, 322-3 
optical splitter, 319-20 

p-n junction 

double heterostructure, 176-7 
homogeneous (homojunction), 175-6 
multijunction, 376-7 
paraxial approximation, 288-92 
permeability, 35 

negative, 386, 391-5 
permittivity, 35 
negative, 387, 389-91, 395 
plasmon-like, 386 
photocurrent, 245 
in photodiodes, 242 
in solar cells, 370, 371, 374 
photodiode 

avalanche (APD), 242, 248, 258 
metal-semiconductor-metal (MSM), 
246 

PIN, 242, 244 
power, 252 
sensitivity, 241,257 
photon 

density, 183 
lifetime, 183, 184, 187 
Planck’s radiation law, 169 
plane wave, 40-1 
plasma, 390 
frequency, 390, 391 
Poisson distribution, 249, 252,253 
polarization 
circular, 42-3 
elliptical, 42-3 
linear, 42 

transverse electric (TE), 44 
transverse electric (TM), 47 
polarization of dielectric medium, 148 
population inversion, 167,204,208 
power budget, 333—4 
Poynting vector, 59-60 
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propagation constant, 41, 42 
in slab waveguides, 66, 95 
pulse broadening, 12, 151 
in optical fibres, 106 
pulse half-width, 142 
pulse type 

chirped Gaussian, 141-2, 159 
Gaussian, 139^10, 159,163,164,281-6 
rectangular, 138-9, 157, 340-2, 344 
Super-Gaussian, 140-1, 159, 162 
pulse wave, see waveform 
pumping, 167, 168 

quantum efficiency, 247 
quantum well, 173, 232 
quasi-Fermi level, 175, 176 

rate equations in EDFA, 211 
rate equations in laser diodes 
for an electric field, 184-7 
for carriers, 182-3 
for photons, 183 
parameters, 184 
ray optics, 17 
in metamaterials, 389 
in slab waveguides, 64-9 
Rayleigh scattering, 110 
receiver, 331, 343 
recombination of carriers, 183 
reflection 

at a plane interface, 17,18 
coefficient, 17, 45,46, 48 
external, 48,49 
internal, 48 

of TE polarized waves, 44-7, 61 
of TM polarized waves, 48, 61 
total internal, see critical angle 
refractive index, 1 7 
in GRIN structures, 20 
negative, 384 

numerical values for popular materials, 18 
relative difference, 65 
relaxation-oscillation frequency, 189 
Resonant cavity, see optical cavity 
rise time, 144 
rise time budget, 333-6 
Runge-Kutta method 
fourth order, 433 
second order, 432 

S-band, see transmission bands 
Sellmeier equation, 31, 125-6, 135 
signal-to-noise ratio (SNR), 208, 248, 

336 

Simpson’s rule (integration method), 437-8 
slowly varying envelope approximation (SVEA), 150, 
296-7 


small-signal analysis (in laser diodes) 
with linear gain, 188-9 
with non-linear gain, 189-92 
Snell’s law, 18 
soliton 

interactions, 363 
period, 362 
spectral 

intensity of solar energy, 368, 369 
responsivity, 247 
split-ring resonator (SRR), 391-4 
split-step Fourier method, 357-60 
steady-state analysis 
in a laser diode, 187 
in an EDFA, 211-12 
step index, 107 
Stokes relations, 24-6 
susceptibility, 186 

TE modes 

in optical fibres, 116-18 
in slab waveguides, 71-2 
three-level system, 210 
threshold 

carrier density, 178, 182 
current, 143 

time constant in photodetectors, 246, 247 
time division multiplexing (TDM), 11 
time-harmonic field, 39—41 
TM modes 

in optical fibres, 116-18 
in slab waveguides, 71-2 
train of pulses, see waveform 
transatlantic telecommunications cable (TAT), 6 
transfer function, 339 
transfer matrix approach 

for antireflection (AR) coatings, 51-3 
for Bragg mirrors, 54-7 
for slab waveguides, 79-85 
transitions 

in a two-level system, 169-70 
in semiconductors, 174-5 
transmission bands, 10 

transmittance of Fabry-Perot interferometer, 29, 33 
transmitter, 331, 342 
transparency density, 178 
transparent boundary conditions, 304—6 
transverse resonance condition, 66-7 
normalized form, 67-9 
trapezoidal rule (integration method), 437 
two-level system (TLS), 167, 169 

U-band, see transmission bands 
velocity 

group, 23—4, 124, 126 
phase, 21-3, 32 

Verlet differentiation method, 432 
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wave equation, 38-9 
for TE modes, 72 
for TM modes, 72 
in cylindrical coordinates, 112 
in metamaterials, 388 
waveform, 145, 160 
waveguide 
2D, 88-92 

asymmetric slab (planar), 75-9, 95 
cylindrical (optical fibre), 110-23 
lossy, 86 


symmetric slab (planar), 72-5 
wavevector, see propagation constant 
weakly guiding approximation (wga), 
118-19 

wire medium, 395 

Y-junction, 14, 325, 327 
Yee algorithm 

lossless in ID, 266-8, 282 
lossless in 2D, 275-7, 285 
lossy in ID, 271-2 



